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HUMAN GENES AND GENE EXPRESSION PRODUCTS XVI 

Field of the Invention 

The present invention relates to polynucleotides of human origin and the encoded gene 
5 products. 

Background of the Invention 

Identification of novel polynucleotides, particularly those that encode an expressed gene 

product, is important in the advancement of drug discovery, diagnostic technologies, and the 
10 understanding of the progression and nature of complex diseases such as cancer. Identification of 

genes expressed in different cell types isolated from sources that differ in disease state or stage, 

developmental stage, exposure to various environmental factors, the tissue of origin, the species 

from which the tissue was isolated, and the like is key to identifying the genetic factors that are 

responsible for the phenotypes associated with these various differences. 
15 This invention provides novel human polynucleotides, the polypeptides encoded by these 

polynucleotides, and the genes and proteins corresponding to these novel polynucleotides. 

Summary of the Invention 

This invention relates to novel human polynucleotides and variants thereof, their encoded 

20 polypeptides and variants thereof, to genes corresponding to these polynucleotides and to proteins 
expressed by the genes. The invention also relates to diagnostics and therapeutics comprising such 
novel human polynucleotides, their corresponding genes or gene products, including probes, 
antisense nucleotides, and antibodies. The polynucleotides of the invention correspond to a 
polynucleotide comprising the sequence information of at least one of SEQ ID NOS:l-316. 

25 Various aspects and embodiments of the invention will be readily apparent to the ordinarily 

skilled artisan upon reading the description provided herein. 

Brief Description of the Figures 

Figures 1A-1B is a comparison of SEQ ID NO:315 and clone H72034 (SEQ ID NO:317). 
30 Figure 2 is a comparison of SEQ ID NO:316 and clone AA707002 (SEQ ID NO:318). 

Detailed Description of the Invention 

The invention relates to polynucleotides comprising the disclosed nucleotide sequences, to 
full length cDNA, mRNA genomic sequences, and genes corresponding to these sequences and 
35 degenerate variants thereof, and to polypeptides encoded by the polynucleotides of the invention 
and polypeptide variants. The following detailed description describes the polynucleotide 
compositions encompassed by the invention, methods for obtaining cDNA or genomic DNA 
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encoding a full-length gene product, expression of these polynucleotides and genes, identification 
of structural motifs of the polynucleotides and genes, identification of the function of a gene 
product encoded by a gene corresponding to a polynucleotide of the invention, use of the provided 
polynucleotides as probes and in mapping and in tissue profiling, use of the corresponding 
5 polypeptides and other gene products to raise antibodies, and use of the polynucleotides and their 
encoded gene products for therapeutic and diagnostic purposes. 

Polynucleotide Compositions 

The scope of the invention with respect to polynucleotide compositions includes, but is not 

10 necessarily limited to, polynucleotides having a sequence set forth in any one of SEQ ID NOS: 1- 
316; polynucleotides obtained from the biological materials described herein or other biological 
sources (particularly human sources) by hybridization under stringent conditions (particularly 
conditions of high stringency); genes corresponding to the provided polynucleotides; variants of the 
provided polynucleotides and their corresponding genes, particularly those variants that retain a 

15 biological activity of the encoded gene product {e.g., a biological activity ascribed to a gene product 
corresponding to the provided polynucleotides as a result of the assignment of the gene product to a 
protein family(ies) and/or identification of a functional domain present in the gene product). Other 
nucleic acid compositions contemplated by and within the scope of the present invention will be 
readily apparent to one of ordinary skill in the art when provided with the disclosure here. 

20 "Polynucleotide" and "nucleic acid" as used herein with reference to nucleic acids of the 

composition is not intended to be limiting as to the length or structure of the nucleic acid unless 
specifically indicted. 

The invention features polynucleotides that are expressed in human tissue, specifically 
human colon, breast, and/or lung tissue. Novel nucleic acid compositions of the invention of 

25 particular interest comprise a sequence set forth in any one of SEQ ID NOS: 1-3 16 or an 

identifying sequence thereof. An "identifying sequence" is a contiguous sequence of residues at 
least about 10 nt to about 20 nt in length, usually at least about 50 nt to about 100 nt in length, that 
uniquely identifies a polynucleotide sequence, e.g., exhibits less than 90%, usually less than about 
80% to about 85% sequence identity to any contiguous nucleotide sequence of more than about 

30 20 nt. Thus, the subject novel nucleic acid compositions include full length cDNAs or mRNAs that 
encompass an identifying sequence of contiguous nucleotides from any one of SEQ ID NOS: 1- 
316. 

The polynucleotides of the invention also include polynucleotides having sequence 
similarity or sequence identity. Nucleic acids having sequence similarity are detected by 
35 hybridization under low stringency conditions, for example, at 50°C and 10XSSC (0.9 M 

saline/0.09 M sodium citrate) and remain bound when subjected to washing at 55°C in 1XSSC. 
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Sequence identity can be determined by hybridization under stringent conditions, for example, at 
50°C or higher and 0.1XSSC (9 mM saline/0.9 mM sodium citrate). Hybridization methods and 
conditions are well known in the art, see, e.g., USPN 5,707,829. Nucleic acids that are 
substantially identical to the provided polynucleotide sequences, e.g. allelic variants, genetically 
5 altered versions of the gene, etc., bind to the provided polynucleotide sequences ( SEQ ID NOS: 1- 
316) under stringent hybridization conditions. By using probes, particularly labeled probes of DNA 
sequences, one can isolate homologous or related genes. The source of homologous genes can be 
any species, e.g. primate species, particularly human; rodents, such as rats and mice; canines, 
felines, bovines, ovines, equines, yeast, nematodes, etc, 

10 Preferably, hybridization is performed using at least 15 contiguous nucleotides (nt) of at 

least one of SEQ ID NOS: 1-3 16. That is, when at least 15 contiguous nt of one of the disclosed 
SEQ ID NOS. is used as a probe, the probe will preferentially hybridize with a nucleic acid 
■ comprising the complementary sequence, allowing the identification and retrieval of the nucleic 
acids that uniquely hybridize to the selected probe. Probes from more than one SEQ ID NO. can 

15 hybridize with the same nucleic acid if the cDNA from which they were derived corresponds to one 
mRNA. Probes of more than 15 nt can be used, e.g., probes of from about 18 nt to about 100 nt, 
but 15 nt represents sufficient sequence for unique identification. 

The polynucleotides of the invention also include naturally occurring variants of the 
nucleotide sequences {e.g., degenerate variants, allelic variants, etc.). Variants of the 

20 polynucleotides of the invention are identified by hybridization of putative variants with nucleotide 
sequences disclosed herein, preferably by hybridization under stringent conditions. For example, 
by using appropriate wash conditions, variants of the polynucleotides of the invention can be 
identified where the allelic variant exhibits at most about 25-30% base pair (bp) mismatches 
relative to the selected polynucleotide probe. In general, allelic variants contain 15-25% bp 

25 mismatches, and can contain as little as even 5-15%, or 2-5%, or 1-2% bp mismatches, as well as a 
single bp mismatch. 

The invention also encompasses homologs corresponding to the polynucleotides of SEQ 
ID NOS: 1-3 16, where the source of homologous genes can be any mammalian species, e.g., 
primate species, particularly human; rodents, such as rats; canines, felines, bovines, ovines, 

30 equines, yeast, nematodes, etc. Between mammalian species, e.g., human and mouse, homologs 
generally have substantial sequence similarity, e.g., at least 75% sequence identity, usually at least 
90%, more usually at least 95% between nucleotide sequences. Sequence similarity is calculated 
based on a reference sequence, which may be a subset of a larger sequence, such as a conserved 
motif, coding region, flanking region, etc, A reference sequence will usually be at least about 18 

35 contiguous nt long, more usually at least about 30 nt long, and may extend to the complete 
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sequence that is being compared. Algorithms for sequence analysis are known in the art, such as 
gapped BLAST, described in Altschul, et al. Nucleic Acids Res. (1997) 25:3389-3402. 

In general, variants of the invention have a sequence identity greater than at least about 
65%, preferably at least about 75%, more preferably at least about 85%, and can be greater than at 
5 least about 90% or more as determined by the Smith- Waterman homology search algorithm as 
implemented in MPSRCH program (Oxford Molecular). For the purposes of this invention, a 
preferred method of calculating percent identity is the Smith- Waterman algorithm, using the 
following. Global DNA sequence identity must be greater than 65% as determined by the Smith- 
Waterman homology search algorithm as implemented in MPSRCH program (Oxford Molecular) 
10 using an affine gap search with the following search parameters: gap open penalty, 12; and gap 
extension penalty, 1. 

The subject nucleic acids can be cDNAs or genomic DNAs, as well as fragments thereof, 
particularly fragments that encode a biologically active gene product and/or are useful in the 
methods disclosed herein (e.g., in diagnosis, as a unique identifier of a differentially expressed gene 

15 of interest, etc.). The term "cDNA" as used herein is intended to include all nucleic acids that share 
the arrangement of sequence elements found in native mature mRNA species, where sequence 
elements are exons and 3' and 5 5 non-coding regions. Normally mRNA species have contiguous 
exons, with the intervening introns, when present, being removed by nuclear RNA splicing, to 
create a continuous open reading frame encoding a polypeptide of the invention. 

20 A genomic sequence of interest comprises the nucleic acid present between the initiation 

codon and the stop codon, as defined in the listed sequences, including all of the introns that are 
normally present in a native chromosome. It can further include the 3' and 5' untranslated regions 
found in the mature mRNA. It can further include specific transcriptional and translational 
regulatory sequences, such as promoters, enhancers, etc., including about 1 kb, but possibly more, 

25 of flanking genomic DNA at either the 5' and 3 5 end of the transcribed region. The genomic DNA 
can be isolated as a fragment of 100 kbp or smaller; and substantially free of flanking chromosomal 
sequence. The genomic DNA flanking the coding region, either 3' and 5', or internal regulatory 
sequences as sometimes found in introns, contains sequences required for proper tissue, stage- 
specific, or disease-state specific expression. 

30 The nucleic acid compositions of the subject invention can encode all or a part of the 

subject polypeptides. Double or single stranded fragments can be obtained from the DNA sequence 
by chemically synthesizing oligonucleotides in accordance with conventional methods, by 
restriction enzyme digestion, by PCR amplification, etc. Isolated polynucleotides and 
polynucleotide fragments of the invention comprise at least about 10, about 15, about 20, about 35, 

35 about 50, about 100, about 150 to about 200, about 250 to about 300, or about 350 contiguous nt 
selected from the polynucleotide sequences as shown in SEQ ID NOS: 1-3 16. For the most part, 
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fragments will be of at least 15 nt, usually at least 18 nt or 25 nt, and up to at least about 50 
contiguous nt in length or more. In a preferred embodiment, the polynucleotide molecules 
comprise a contiguous sequence of at least 12 nt selected from the group consisting of the 
polynucleotides shown in SEQ ID NOS:l-316. 
5 Probes specific to the polynucleotides of the invention can be generated using the 

polynucleotide sequences disclosed in SEQ ID NOS: 1-3 16. The probes are preferably at least 
about a 12, 15, 16, 18, 20, 22, 24, or 25 nt fragment of a corresponding contiguous sequence of 
SEQ ID NOS: 1-3 16, and can be less than 2, 1, 0.5, 0.1, or 0.05 kb in length. The probes can be 
synthesized chemically or can be generated from longer polynucleotides using restriction enzymes. 

10 The probes can be labeled, for example, with a radioactive, biotinylated, or fluorescent tag. 

Preferably, probes are designed based upon an identifying sequence of a polynucleotide of one of 
SEQ ID NOS: 1-3 16. More preferably, probes are designed based on a contiguous sequence of one 
of the subject polynucleotides that remain unmasked following application of a masking program 
for masking low complexity (e.g., XBLAST) to the sequence., i.e., one would select an unmasked 

15 region, as indicated by the polynucleotides outside the poly-n stretches of the masked sequence 
produced by the masking program. 

The polynucleotides of the subject invention are isolated and obtained in substantial purity, 
generally as other than an intact chromosome. Usually, the polynucleotides, either as DNA or 
RNA, will be obtained substantially free of other naturally-occurring nucleic acid sequences, 

20 generally being at least about 50%, usually at least about 90% pure and are typically 

"recombinant", e.g., flanked by one or more nucleotides with which it is not normally associated on 
a naturally occurring chromosome. 

The polynucleotides of the invention can be provided as a linear molecule or within a 
circular molecule, and can be provided within autonomously replicating molecules (vectors) or 

25 within molecules without replication sequences. Expression of the polynucleotides can be 

regulated by their own or by other regulatory sequences known in the art. The polynucleotides of 
the invention can be introduced into suitable host cells using a variety of techniques available in the 
art, such as transferrin polycation-mediated DNA transfer, transfection with naked or encapsulated 
nucleic acids, liposome-mediated DNA transfer, intracellular transportation of DNA-coated latex 

30 beads, protoplast fusion, viral infection, electroporation, gene gun, calcium phosphate-mediated 
transfection, and the like. 

The subject nucleic acid compositions can be used to, for example, produce polypeptides, 
as probes for the detection of mRNA of the invention in biological samples (e.g., extracts of human 
cells) to generate additional copies of the polynucleotides, to generate ribozymes or antisense 

35 oligonucleotides, and as single stranded DNA probes or as triple-strand forming oligonucleotides. 
The probes described herein can be used to, for example, determine the presence or absence of the 
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polynucleotide sequences as shown in SEQ ID NOS: 1-3 1 6 or variants thereof in a sample. These 
and other uses are described in more detail below. 

Use of Polynucleotides to Obtain Full-Length cDNA, Gene, and Promoter Region 

Full-length cDNA molecules comprising the disclosed polynucleotides are obtained as 
5 follows. A polynucleotide having a sequence of one of SEQ ID NOS:l-3 16, or a portion thereof 
comprising at least 12, 15, 18, or 20 nt, is used as a hybridization probe to detect hybridizing 
members of a cDNA library using probe design methods, cloning methods, and clone selection 
techniques such as those described in USPN 5,654,173. Libraries of cDNA are made from selected 
tissues, such as normal or tumor tissue, or from tissues of a mammal treated with, for example, a 

10 pharmaceutical agent. Preferably, the tissue is the same as the tissue from which the 

polynucleotides of the invention were isolated, as both the polynucleotides described herein and the 
cDNA represent expressed genes. Most preferably, the cDNA library is made from the biological 
material described herein in the Examples. The choice of cell type for library construction can be 
made after the identity of the protein encoded by the gene corresponding to the polynucleotide of 

15 the invention is known. This will indicate which tissue and cell types are likely to express the 

related gene, and thus represent a suitable source for the mRNA for generating the cDNA. Where 
the provided polynucleotides are isolated from cDNA libraries, the libraries are prepared from 
mRNA of human colon cells, more preferably, human colon cancer cells, even more preferably, 
from a highly metastatic colon cell, Kml2L4-A. 

20 Techniques for producing and probing nucleic acid sequence libraries are described, for 

example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold 
Spring Harbor Press, Cold Spring Harbor, NY. The cDNA can be prepared by using primers based 
on sequence from SEQ ID NOS: 1-3 16. In one embodiment, the cDNA library can be made from 
only poly-adenylated mRNA. Thus, poly-T primers can be used to prepare cDNA from the mRNA. 

25 Members of the library that are larger than the provided polynucleotides, and preferably 

that encompass the complete coding sequence of the native message, are obtained. In order to 
confirm that the entire cDNA has been obtained, RNA protection experiments are performed as 
follows. Hybridization of a full-length cDNA to an mRNA will protect the RNA from RNase 
degradation. If the cDNA is not full length, then the portions of the mRNA that are not hybridized 

30 will be subject to RNase degradation. This is assayed, as is known in the art, by changes in 

electrophoretic mobility on polyacrylamide gels, or by detection of released monoribonucleotides. 
Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., (1989) Cold Spring Harbor 
Press, Cold Spring Harbor, NY. In order to obtain additional sequences 5 ! to the end of a partial 
cDNA, 5' RACE {PGR Protocols: A Guide to Methods and Applications, (1990) Academic Press, 

35 Inc.) can be performed. 
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Genomic DNA is isolated using the provided polynucleotides in a manner similar to the 
isolation of full-length cDNAs. Briefly, the provided polynucleotides, or portions thereof, are used 
as probes to libraries of genomic DNA. Preferably, the library is obtained from the cell type that 
was used to generate the polynucleotides of the invention, but this is not essential. Most preferably, 
5 the genomic DNA is obtained from the biological material described herein in the Examples. Such 
libraries can be in vectors suitable for carrying large segments of a genome, such as PI or YAC, as 
described in detail in Sambrook et al, 9.4-9.30. In addition, genomic sequences can be isolated 
from human BAC libraries, which are commercially available from Research Genetics, Inc., 
Huntsville, Alabama, USA, for example. In order to obtain additional 5' or 3' sequences, 

10 chromosome walking is performed, as described in Sambrook et al., such that adjacent and 

overlapping fragments of genomic DNA are isolated. These are mapped and pieced together, as is 
known in the art, using restriction digestion enzymes and DNA ligase. 

Using the polynucleotide sequences of the invention, corresponding full-length genes can 
be isolated using both classical and PCR methods to construct and probe cDNA libraries. Using 

15 either method, Northern blots, preferably, are performed on a number of cell types to determine 

which cell lines express the gene of interest at the highest level. Classical methods of constructing 
cDNA libraries are taught in Sambrook et aL, supra. With these methods, cDNA can be produced 
from mRNA and inserted into viral or expression vectors. Typically, libraries of mRNA 
comprising poly(A) tails can be produced with poly(T) primers. Similarly, cDNA libraries can be 

20 produced using the instant sequences as primers. 

PCR methods are used to amplify the members of a cDNA library that comprise the desired 
insert. In this case, the desired insert will contain sequence from the full length cDNA that 
corresponds to the instant polynucleotides. Such PCR methods include gene trapping and RACE 
methods. Gene trapping entails inserting a member of a cDNA library into a vector. The vector 

25 then is denatured to produce single stranded molecules. Next, a substrate-bound probe, such a 

biotinylated oligo, is used to trap cDNA inserts of interest. Biotinylated probes can be linked to an 
avidin-bound solid substrate. PCR methods can be used to amplify the trapped cDNA. To trap 
sequences corresponding to the full length genes, the labeled probe sequence is based on the 
polynucleotide sequences of the invention. Random primers or primers specific to the library 

30 vector can be used to amplify the trapped cDNA. Such gene trapping techniques are described in 
Gruber et al, WO 95/04745 and Gruber et al, USPN 5,500,356. Kits are commercially available 
to perform gene trapping experiments from, for example, Life Technologies, Gaithersburg, 
Maryland, USA. 

"Rapid amplification of cDNA ends," or RACE, is a PCR method of amplifying cDNAs 
35 from a number of different RNAs. The cDNAs are ligated to an oligonucleotide linker, and 
amplified by PCR using two primers. One primer is based on sequence from the instant 
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polynucleotides, for which full length sequence is desired, and a second primer comprises sequence 
that hybridizes to the oligonucleotide linker to amplify the cDNA. A description of this methods is 
reported in WO 97/191 10. In preferred embodiments of RACE, a common primer is designed to 
anneal to an arbitrary adaptor sequence ligated to cDNA ends (Apte and Siebert, Biotechniques 
(1993) 75:890-893; Edwards et aL, Nuc. Acids Res. (1991) 7.9:5227-5232). When a single gene- 
specific RACE primer is paired with the common primer, preferential amplification of sequences 
between the single gene specific primer and the common primer occurs. Commercial cDNA pools 
modified for use in RACE are available. 

Another PCR-based method generates full-length cDNA library with anchored ends 
without needing specific knowledge of the cDNA sequence. The method uses lock-docking 
primers (I-VI), where one primer, poly TV (I-III) locks over the polyA tail of eukaryotic mRNA 
producing first strand synthesis and a second primer, polyGH (IV- VI) locks onto the polyC tail 
added by terminal deoxynucleotidyl transferase (TdT)(see, e.g., WO 96/40998). 

The promoter region of a gene generally is located 5 5 to the initiation site for RNA 
polymerase II. Hundreds of promoter regions contain the "TATA" box, a sequence such as 
TATTA or TATAA, which is sensitive to mutations. The promoter region can be obtained by 
performing 5 5 RACE using a primer from the coding region of the gene. Alternatively, the cDNA 
can be used as a probe for the genomic sequence, and the region 5' to the coding region is identified 
by "walking up." If the gene is highly expressed or differentially expressed, the promoter from the 
gene can be of use in a regulatory construct for a heterologous gene. 

Once the full-length cDNA or gene is obtained, DNA encoding variants can be prepared by 
site-directed mutagenesis, described in detail in Sambrook et aL, 15.3-15.63. The choice of codon 
or nucleotide to be replaced can be based on disclosure herein on optional changes in amino acids 
to achieve altered protein structure and/or function. 

As an alternative method to obtaining DNA or RNA from a biological material, nucleic 
acid comprising nucleotides having the sequence of one or more polynucleotides of the invention 
can be synthesized. Thus, the invention encompasses nucleic acid molecules ranging in length 
from 15 nt (corresponding to at least 15 contiguous nt of one of SEQ ID NOS.T-316) up to a 
maximum length suitable for one or more biological manipulations, including replication and 
expression, of the nucleic acid molecule. The invention includes but is not limited to (a) nucleic 
acid having the size of a full gene, and comprising at least one of SEQ ID NOS: 1-316; (b) the 
nucleic acid of (a) also comprising at least one additional gene, operably linked to permit 
expression of a fusion protein; (c) an expression vector comprising (a) or (b); (d) a plasmid 
comprising (a) or (b) ; and (e) a recombinant viral particle comprising (a) or (b). Once provided 
with the polynucleotides disclosed herein, construction or preparation of (a) - (e) are well within the 
skill in the art. 
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The sequence of a nucleic acid comprising at least 15 contiguous nt of at least any one of 
SEQ ID NOS:l-316, preferably the entire sequence of at least any one of SEQ ID NOS:l-316, is 
not limited and can be any sequence of A, T, G, and/or C (for DNA) and A, U, G, and/or C (for 
RNA) or modified bases thereof, including inosine and pseudouridine. The choice of sequence will 
5 depend on the desired function and can be dictated by coding regions desired, the intron-like 

regions desired, and the regulatory regions desired. Where the entire sequence of any one of SEQ 
ID NOS: 1-3 16 is within the nucleic acid, the nucleic acid obtained is referred to herein as a 
polynucleotide comprising the sequence of any one of SEQ ID NOS : 1 -3 1 6. 
Expression of Polypeptide Encoded by Full-Length cDNA or Full-Length Gene 

10 The provided polynucleotides (e.g., a polynucleotide having a sequence of one of SEQ ID 

NOS: 1-3 16), the corresponding cDNA, or the full-length gene is used to express a partial or 
complete gene product. Constructs of polynucleotides having sequences of SEQ ID NOS: 1-3 16 
can also be generated synthetically. Alternatively, single-step assembly of a gene and entire 
plasmid from large numbers of oligodeoxyribonucleotides is described by, e.g. , Stemmer et al, 

15 Gene (Amsterdam) (1995) 164(I):49-53. In this method, assembly PCR (the synthesis of long 
DNA sequences from large numbers of oligodeoxyribonucleotides (oligos)) is described. The 
method is derived from DNA shuffling (Stemmer, Nature (1994) 370:389-391), and does not rely 
on DNA ligase, but instead relies on DNA polymerase to build increasingly longer DNA fragments 
during the assembly process. 

20 Appropriate polynucleotide constructs are purified using standard recombinant DNA 

techniques as described in, for example, Sambrook et aL, Molecular Cloning: A Labor atoiy 
Manual, 2nd Ed., (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, and under current 
regulations described in United States Dept. of HHS, National Institute of Health (NIH) Guidelines 
for Recombinant DNA Research. The gene product encoded by a polynucleotide of the invention is 

25 expressed in any expression system, including, for example, bacterial, yeast, insect, amphibian and 
mammalian systems. Vectors, host cells and methods for obtaining expression in same are well 
known in the art. Suitable vectors and host cells are described in USPN 5,654,173. 

Polynucleotide molecules comprising a polynucleotide sequence provided herein are 
generally propagated by placing the molecule in a vector. Viral and non- viral vectors are used, 

30 including plasmids. The choice of plasmid will depend on the type of cell in which propagation is 
desired and the purpose of propagation. Certain vectors are useful for amplifying and making large 
amounts of the desired DNA sequence. Other vectors are suitable for expression in cells in culture. 
Still other vectors are suitable for transfer and expression in cells in a whole animal or person. The 
choice of appropriate vector is well within the skill of the art. Many such vectors are available 

35 commercially. Methods for preparation of vectors comprising a desired sequence are well known 
in the art. 
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The polynucleotides set forth in SEQ ID NOS:l-316 or their corresponding full-length 
polynucleotides are linked to regulatory sequences as appropriate to obtain the desired expression 
properties. These can include promoters (attached either at the 5' end of the sense strand or at the 3' 
end of the antisense strand), enhancers, terminators, operators, repressors, and inducers. The 
5 promoters can be regulated or constitutive. In some situations it may be desirable to use 

conditionally active promoters, such as tissue-specific or developmental stage-specific promoters. 
These are linked to the desired nucleotide sequence using the techniques described above for 
linkage to vectors. Any techniques known in the art can be used. 

When any of the above host cells, or other appropriate host cells or organisms, are used to 
10 replicate and/or express the polynucleotides or nucleic acids of the invention, the resulting 

replicated nucleic acid, RNA, expressed protein or polypeptide, is within the scope of the invention 
as a product of the host cell or organism. The product is recovered by any appropriate means 
known in the art. 

Once the gene corresponding to a selected polynucleotide is identified, its expression can 
15 be regulated in the cell to which the gene is native. For example, an endogenous gene of a cell can 
be regulated by an exogenous regulatory sequence as disclosed in USPN 5,641,670. 
Identification of Functional and Structural Motifs of Novel Genes Screening Against Publicly 
Available Databases 

Translations of the nucleotide sequence of the provided polynucleotides, cDNAs or full 
20 genes can be aligned with individual known sequences. Similarity with individual sequences can 
be used to determine the activity of the polypeptides encoded by the polynucleotides of the 
invention. Also, sequences exhibiting similarity with more than one individual sequence can 
- exhibit activities that are characteristic of either or both individual sequences. 

The full length sequences and fragments of the polynucleotide sequences of the nearest 
25 neighbors can be used as probes and primers to identify and isolate the full length sequence 

corresponding to provided polynucleotides. The nearest neighbors can indicate a tissue or cell type 
to be used to construct a library for the full-length sequences corresponding to the provided 
polynucleotides. 

Typically, a selected polynucleotide is translated in all six frames to determine the best 
30 alignment with the individual sequences. The sequences disclosed herein in the Sequence Listing 
are in a 5' to 3' orientation and translation in three frames can be sufficient (with a few specific 
exceptions as described in the Examples). These amino acid sequences are referred to, generally, as 
query sequences, which will be aligned with the individual sequences. Databases with individual 
sequences are described in "Computer Methods for Macromolecular Sequence Analysis" Methods 
35 in Enzymology (1996) 266, Doolittle, Academic Press, Inc., a division of Harcourt Brace & Co., 

10 
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San Diego, California, USA. Databases include GenBank, EMBL, and DNA Database of Japan 
(DDBJ). 

Query and individual sequences can be aligned using the methods and computer programs 
described above, and include BLAST 2.0, available over the world wide web at a site supported by 
5 the National Center for Biotechnology Information, which is supported by the National Library of 
Medicine and the National Institutes of Health. See also Altschul, et al. Nucleic Acids Res. (1997) 
25:3389-3402. Another alignment algorithm is Fasta, available in the Genetics Computing Group 
(GCG) package, Madison, Wisconsin, USA, a wholly owned subsidiary of Oxford Molecular 
Group, Inc. Other techniques for alignment are described in Doolittle, supra. Preferably, an 

10 alignment program that permits gaps in the sequence is utilized to align the sequences. The Smith- 
Waterman is one type of algorithm that permits gaps in sequence alignments. See Meth. Mol Biol 
(1997) 70: 173-187. Also, the GAP program using the Needleman and Wunsch alignment method 
can be utilized to align sequences. An alternative search strategy uses MPSRCH software, which 
runs on a MASPAR computer. MPSRCH uses a Smith- Waterman algorithm to score sequences on 

15 a massively parallel computer. This approach improves ability to identify sequences that are 

distantly related matches, and is especially tolerant of small gaps and nucleotide sequence errors. 
Amino acid sequences encoded by the provided polynucleotides can be used to search both protein 
and DNA databases. Incorporated herein by reference are all sequences that have been made public 
as of the filing date of this application by any of the DNA or protein sequence databases, including 

20 the patent databases (e.g., GeneSeq). Also incorporated by reference are those sequences that have 
been submitted to these databases as of the filing date of the present application but not made 
public until after the filing date of the present application. 

Results of individual and query sequence alignments can be divided into three categories: 
high similarity, weak similarity, and no similarity. Individual alignment results ranging from high 

25 similarity to weak similarity provide a basis for determining polypeptide activity and/or structure. 
Parameters for categorizing individual results include: percentage of the alignment region length 
where the strongest alignment is found, percent sequence identity, and p value. The percentage of 
the alignment region length is calculated by counting the number of residues of the individual 
sequence found in the region of strongest alignment, e.g., contiguous region of the individual 

30 * sequence that contains the greatest number of residues that are identical to the residues of the 

corresponding region of the aligned query sequence. This number is divided by the total residue 
length of the query sequence to calculate a percentage. For example, a query sequence of 20 amino 
acid residues might be aligned with a 20 amino acid region of an individual sequence. The 
individual sequence might be identical to amino acid residues 5, 9-15, and 17-19 of the query 

35 sequence. The region of strongest alignment is thus the region stretching from residue 9-19, an 1 1 
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amino acid stretch. The percentage of the alignment region length is: 1 1 (length of the region of 
strongest alignment) divided by (query sequence length) 20 or 55%. 

Percent sequence identity is calculated by counting the number of amino acid matches 
between the query and individual sequence and dividing total number of matches by the number of 
5 residues of the individual sequences found in the region of strongest alignment. Thus, the percent 
identity in the example above would be 10 matches divided by 1 1 amino acids, or approximately, 
90.9% 

P value is the probability that the alignment was produced by chance. For a single 
alignment, the p value can be calculated according to Karlin et al, Proc. Natl. Acad, Sci. (1990) 

10 57:2264 and Karlin et ah, Proc, Natl Acad, Set (1993) 90. The p value of multiple alignments 

using the same query sequence can be calculated using an heuristic approach described in Altschul 
et al, Nat. Genet (1994) 6: 1 19. Alignment programs such as BLAST program can calculate the p 
value. See also Altschul et al, Nucleic Acids Res. (1997) 25:3389-3402. 

Another factor to consider for determining identity or similarity is the location of the 

15 similarity or identity. Strong local alignment can indicate similarity even if the length of alignment 
is short. Sequence identity scattered throughout the length of the query sequence also can indicate 
a similarity between the query and profile sequences. The boundaries of the region where the 
sequences align can be determined according to Doolittle, supra; BLAST 2.0 (see, e.g., Altschul, et 
al. Nucleic Acids Res. (1997) 25:3389-3402) or FAST programs; or by determining the area where 

20 sequence identity is highest. 

High Similarity. In general, in alignment results considered to be of high similarity, the 
percent of the alignment region length is typically at least about 55% of total length query 
sequence; more typically, at least about 58%; even more typically; at least about 60% of the total 
residue length of the query sequence. Usually, percent length of the alignment region can be as 

25 much as about 62%; more usually, as much as about 64%; even more usually, as much as about 

66%. Further, for high similarity, the region of alignment, typically, exhibits at least about 75% of 
sequence identity; more typically, at least about 78%; even more typically; at least about 80% 
sequence identity. Usually, percent sequence identity can be as much as about 82%; more usually, 
as much as about 84%; even more usually, as much as about 86%. 

30 The p value is used in conjunction with these methods. If high similarity is found, the 

query sequence is considered to have high similarity with a profile sequence when the p value is 
less than or equal to about 10" 2 ; more usually; less than or equal to about 1>0' 3 ; even more usually; 
less than or equal to about 10" 4 . More typically, the p value is no more than about 10" 5 ; more 
typically; no more than or equal to about 10" 10 ; even more typically; no more than or equal to about 

35 10" 15 for the query sequence to be considered high similarity. 
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Weak Similarity, In general, where alignment results considered to be of weak similarity, 
there is no minimum percent length of the alignment region nor minimum length of alignment. A 
better showing of weak similarity is considered when the region of alignment is, typically, at least 
about 15 amino acid residues in length; more typically, at least about 20; even more typically; at 
5 least about 25 amino acid residues in length. Usually, length of the alignment region can be as 
much as about 30 amino acid residues; more usually, as much as about 40; even more usually, as 
much as about 60 amino acid residues. Further, for weak similarity, the region of alignment, 
typically, exhibits at least about 35% of sequence identity; more typically, at least about 40%; even 
more typically; at least about 45% sequence identity. Usually, percent sequence identity can be as 
10 much as about 50%; more usually, as much as about 55%; even more usually, as much as about 
60%. 

If low similarity is found, the query sequence is considered to have weak similarity with a 
profile sequence when the p value is usually less than or equal to about 10~ 2 ; more usually; less than 
or equal to about 10' 3 ; even more usually; less than or equal to about 10" 4 . More typically, the p 
15 value is no more than about 10" 5 ; more usually; no more than or equal to about 10' 10 ; even more 
usually; no more than or equal to about 10" 15 for the query sequence to be considered weak 
similarity. 

Similarity Determined by Sequence Identity Alone. Sequence identity alone can be used to 
. determine similarity of a query sequence to an individual sequence and can indicate the activity of 

20 the sequence. Such an alignment, preferably, permits gaps to align sequences. Typically, the query 
sequence is related to the profile sequence if the sequence identity over the entire query sequence is 
at least about 15%; more typically, at least about 20%; even more typically, at least about 25%; 
even more typically, at least about 50%. Sequence identity alone as a measure of similarity is most 
useful when the query sequence is usually, at least 80 residues in length; more usually, 90 residues; 

25 even more usually, at least 95 amino acid residues in length. More typically, similarity can be 

concluded based on sequence identity alone when the query sequence is preferably 100 residues in 
length; more preferably, 120 residues in length; even more preferably, 150 amino acid residues in 
length. 

Alignments with Profile and Multiple Aligned Sequences. Translations of the provided 
30 polynucleotides can be aligned with amino acid profiles that define either protein families or 
common motifs. Also, translations of the provided polynucleotides can be aligned to multiple 
sequence alignments (MSA) comprising the polypeptide sequences of members of protein families 
or motifs. Similarity or identity with profile sequences or MSAs can be used to determine the 
activity of the gene products (e.g., polypeptides) encoded by the provided polynucleotides or 
35 corresponding cDNA or genes. For example, sequences that show an identity or similarity with a 
chemokine profile or MSA can exhibit chemokine activities. 
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Profiles can designed manually by (1) creating an MSA, which is an alignment of the 
amino acid sequence of members that belong to the family and (2) constructing a statistical 
representation of the alignment. Such methods are described, for example, in Birney et al, Nucl 
Acid Res. (1996) 24(14): 2730-2739. MSAs of some protein families and motifs are publicly 
5 available. For example, the Genome Sequencing Center at thw Washington University School of 
Medicine provides a web set (Pfam) which includes MSAs of 547 different families and motifs. 
These MSAs are described also in Sonnhammer et al, Proteins (1997) 28: 405-420. Other sources 
over the world wide web include the site supported by the European Molecular Biology 
Laboratories in Heidelberg, Germany. A brief description of these MSAs is reported in Pascarella 

10 et al, Prot Eng. (1996) 9(3):249-25 1 . Techniques for building profiles from MSAs are described 
in Sonnhammer et al, supra; Birney et al, supra; and "Computer Methods for Macromolecular 
Sequence Analysis," Methods in Enzymology (1996) 266, Doolittle, Academic Press, Inc., San 
Diego, California, USA. 

Similarity between a query, sequence and a protein family or motif can be determined by (a) 

15 comparing the query sequence against the profile and/or (b) aligning the query sequence with the 
members of the family or motif. Typically, a program such as Searchwise is used to compare the 
query sequence to the statistical representation of the multiple alignment, also known as a profile 
(see Birney et al., supra). Other techniques to compare the sequence and profile are described in 
Sonnhammer et al., supra and Doolittle, supra. 

20 Next, methods described by Feng et al., J. Mol Evol (1987) 25:35 1 and Higgins et al, 

CABIOS (1989) 5: 151 can be used align the query sequence with the members of a family or motif, 
also known as a MSA. Sequence alignments can be generated using any of a variety of software 
tools. Examples include PileUp, which creates a multiple sequence alignment, and is described in 
Feng et al, J. Mol Evol. (1987) 25:351. Another method, GAP, uses the alignment method of 

25 Needleman et al, J. Mol. Biol. (1970) 48:443. GAP is best suited for global alignment of 
sequences. A third method, BestFit, functions by inserting gaps to maximize the number of 
matches using the local homology algorithm of Smith et al, Adv. Appl Math. (1981) 2:482. In 
general, the following factors are used to determine if a similarity between a query sequence and a 
profile or MSA exists: (1) number of conserved residues found in the query sequence, (2) 

30 percentage of conserved residues found in the query sequence, (3) number of frameshifts, and (4) 
spacing between conserved residues. 

Some alignment programs that both translate and align sequences can make any number of 
frameshifts when translating the nucleotide sequence to produce the best alignment. The fewer 
frameshifts needed to produce an alignment, the stronger the similarity or identity between the 

35 query and profile or MSAs. For example, a weak similarity resulting from no frameshifts can be a 
better indication of activity or structure of a query sequence, than a strong similarity resulting from 
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two frameshifts. Preferably, three or fewer frameshifts are found in an alignment; more preferably 
two or fewer frameshifts; even more preferably, one or fewer frameshifts; even more preferably, no 
frameshifts are found in an alignment of query and profile or MSAs. 

Conserved residues are those amino acids found at a particular position in all or some of the 
5 family or motif members. Alternatively, a position is considered conserved if only a certain class 
of amino acids is found in a particular position in all or some of the family members. For example, 
the N-terminal position can contain a positively charged amino acid, such as lysine, arginine, or 
histidine. 

Typically, a residue of a polypeptide is conserved when a class of amino acids or a single 
10 amino acid is found at a particular position in at least about 40% of all class members; more 

typically, at least about 50%; even more typically, at least about 60% of the members. Usually, a 

residue is conserved when a class or single amino acid is found in at least about 70% of the 

members of a family or motif; more usually, at least about 80%; even more usually, at least about 

90%; even more usually, at least about 95%. 
15 A residue is considered conserved when three unrelated amino acids are found at a 

particular position in the some or all of the members; more usually, two unrelated amino acids. 

These residues are conserved when the unrelated amino acids are found at particular positions in at 

least about 40% of all class member; more typically, at least about 50%; even more typically, at 

least about 60% of the members. Usually, a residue is conserved when a class or single amino acid 
20 is found in at least about 70% of the members of a family or motif; more usually, at least about 

80%; even more usually, at least about 90%; even more usually, at least about 95%. 

A query sequence has similarity to a profile or MSA when the query sequence comprises at 

least about 25% of the conserved residues of the profile or MSA; more usually, at least about 30%; 

even more usually; at least about 40%. Typically, the query sequence has a stronger similarity to a 
25 profile sequence or MSA when the query sequence comprises at least about 45% of the conserved 

residues of the profile or MSA; more typically, at least about 50%; even more typically; at least 

about 55%. 

Identification of Secreted & Membrane-Bound Polypeptides 

Both secreted and membrane-bound polypeptides of the present invention are of particular 
30 interest. For example, levels of secreted polypeptides can be assayed in body fluids that are 

convenient, such as blood, plasma, serum, and other body fluids such as urine, prostatic fluid and 
semen. Membrane-bound polypeptides are useful for constructing vaccine antigens or inducing an 
immune response. Such antigens would comprise all or part of the extracellular region of the 
membrane-bound polypeptides. Because both secreted and membrane-bound polypeptides 
35 comprise a fragment of contiguous hydrophobic amino acids, hydrophobicity predicting algorithms 
can be used to identify such polypeptides. 
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A signal sequence is usually encoded by both secreted and membrane-bound polypeptide 
genes to direct a polypeptide to the surface of the cell. The signal sequence usually comprises a 
stretch of hydrophobic residues. Such signal sequences can fold into helical structures. Membrane- 
bound polypeptides typically comprise at least one transmembrane region that possesses a stretch of 
5 hydrophobic amino acids that can transverse the membrane. Some transmembrane regions also 
exhibit a helical structure. Hydrophobic fragments within a polypeptide can be identified by using 
computer algorithms. Such algorithms include Hopp & Woods 2 Proc. Natl. Acad. Set USA (1981) 
75:3824-3828; Kyte & Doolittle, J. Mol. Biol. (1982) 157: 105-132; and RAOAR algorithm, Degli 
Esposti et al., Eur. J. Biochem. (1990) 190: 207-219. 

10 Another method of identifying secreted and membrane-bound polypeptides is to translate 

the polynucleotides of the invention in all six frames and determine if at least 8 contiguous 
hydrophobic amino acids are present. Those translated polypeptides with at least 8; more typically, 
10; even more typically, 12 contiguous hydrophobic amino acids are considered to be either a 
putative secreted or membrane bound polypeptide. Hydrophobic amino acids include alanine, 

15 glycine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, proline, threonine, 
tryptophan, tyrosine, and valine. 

Identification of the Function of an Expression Product of a Full-Length Gene 
Ribozymes, antisense constructs, and dominant negative mutants can be used to determine 
function of the expression product of a gene corresponding to a polynucleotide provided herein. 

20 These methods and compositions are particularly useful where the provided novel polynucleotide 
exhibits no significant or substantial homology to a sequence encoding a gene of known function. 
Antisense molecules and ribozymes can be constructed from synthetic polynucleotides. Typically, 
the phosphoramidite method of oligonucleotide synthesis is used. See Beaucage et al., Tet. Lett. 
(1981) 22:1859 and USPN 4,668,777. Automated devices for synthesis are available to create 

25 oligonucleotides using this chemistry. Examples of such devices include Biosearch 8600, Models 
392 and 394 by Applied Biosystems, a division of Perkin-Elmer Corp., Foster City, California, 
USA; and Expedite by Perceptive Biosystems, Framingham, Massachusetts, USA. Synthetic RNA, 
phosphate analog oligonucleotides, and chemically derivatized oligonucleotides can also be 
produced, and can be covalently attached to other molecules. RNA oligonucleotides can be 

30 synthesized, for example, using RNA phosphoramidites. This method can be performed on an 

automated synthesizer, such as Applied Biosystems, Models 392 and 394, Foster City, California, 
USA. 

Phosphorothioate oligonucleotides can also be synthesized for antisense construction. A 
sulfurizing reagent, such as tetraethylthiruam disulfide (TETD) in acetonitrile can be used to 
35 convert the internucleotide cyanoethyl phosphite to the phosphorothioate triester within 15 minutes 
at room temperature. TETD replaces the iodine reagent, while all other reagents used for standard 
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phosphoramidite chemistry remain the same. Such a synthesis method can be automated using 
Models 392 and 394 by Applied Biosystems, for example. 

Oligonucleotides of up to 200 nt can be synthesized, more typically, 100 nt, more typically 
50 nt; even more typically 30 to 40 nt. These synthetic fragments can be annealed and ligated 
5 together to construct larger fragments. See, for example, Sambrook et al. 9 supra. Trans-cleaving 
catalytic RNAs (ribozymes) are RNA molecules possessing endoribonuclease activity. Ribozymes 
are specifically designed for a particular target, and the target message must contain a specific 
nucleotide sequence. They are engineered to cleave any RNA species site-specifically in the 
background of cellular RNA. The cleavage event renders the mRNA unstable and prevents protein 

10 expression. Importantly, ribozymes can be used to inhibit expression of a gene of unknown 

function for the purpose of determining its function in an in vitro or in vivo context, by detecting 
the phenotypic effect. One commonly used ribozyme motif is the hammerhead, for which the 
substrate sequence requirements are minimal. Design of the hammerhead ribozyme, as well as 
therapeutic uses of ribozymes, are disclosed in Usman et ah, Current Opin. Struct. Biol (1996) 

15 6:521. Methods for production of ribozymes, including hairpin structure ribozyme fragments, 
methods of increasing ribozyme specificity, and the like are known in the art. 

The hybridizing region of the ribozyme can be modified or can be prepared as a branched 
structure as described in Horn and Urdea, Nucleic Acids Res. (1989) 77:6959. The basic structure 
of the ribozymes can also be chemically altered in ways familiar to those skilled in the art, and 

20 chemically synthesized ribozymes can be administered as synthetic oligonucleotide derivatives 

modified by monomelic units. In a therapeutic context, liposome mediated delivery of ribozymes 
improves cellular uptake, as described in Birikh et al, Eur. J. Biochem. (1997) 245:1. 

Antisense nucleic acids are designed to specifically bind to RNA, resulting in the formation 
of RNA-DNA or RNA-RNA hybrids, with an arrest of DNA replication, reverse transcription or 

25 messenger RNA translation. Antisense polynucleotides based on a selected polynucleotide 

sequence can interfere with expression of the corresponding gene. Antisense polynucleotides are 
typically generated within the cell by expression from antisense constructs that contain the 
antisense strand as the transcribed strand. Antisense polynucleotides based on the disclosed 
polynucleotides will bind and/or interfere with the translation of mRNA comprising a sequence 

30 complementary to the antisense polynucleotide. The expression products of control cells and cells 
treated with the antisense construct are compared to detect the protein product of the gene 
. corresponding to the polynucleotide upon which the antisense construct is based. The protein is 
isolated and identified using routine biochemical methods. 

Given the extensive background literature and clinical experience in antisense therapy, one 

35 skilled in the art can use selected polynucleotides of the invention as additional potential 

therapeutics. The choice of polynucleotide can be narrowed by first testing them for binding to 
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"hot spot" regions of the genome of cancerous cells. If a polynucleotide is identified as binding to a 
"hot spot", testing the polynucleotide as an antisense compound in the corresponding cancer cells is 
warranted. 

As an alternative method for identifying function of the gene corresponding to a 
5 polynucleotide disclosed herein, dominant negative mutations are readily generated for 

corresponding proteins that are active as homomultimers. A mutant polypeptide will interact with 
wild-type polypeptides (made from the other allele) and form a non-functional multimer. Thus, a 
mutation is in a substrate-binding domain, a catalytic domain, or a cellular localization domain. 
Preferably, the mutant polypeptide will be overproduced. Point mutations are made that have such 

10 an effect. In addition, fusion of different polypeptides of various lengths to the terminus of a 

protein can yield dominant negative mutants. General strategies are available for making dominant 
negative mutants (see, e.g., Herskowitz, Nature (1987) 329:219). Such techniques can be used to 
create loss of function mutations, which are useful for determining protein function. 
Polypeptides and Variants Thereof 

15 The polypeptides of the invention include those encoded by the disclosed polynucleotides, 

as well as nucleic acids that, by virtue of the degeneracy of the genetic code, are not identical in 
sequence to the disclosed polynucleotides. Thus, the invention includes within its scope a 
polypeptide encoded by a polynucleotide having the sequence of any one of SEQ ID NOS : 1 -3 1 6 or 
a variant thereof. 

20 In general, the term "polypeptide" as used herein refers to both the full length polypeptide 

encoded by the recited polynucleotide, the polypeptide encoded by the gene represented by the 
recited polynucleotide, as well as portions or fragments thereof. "Polypeptides" also includes 
variants of the naturally occurring proteins, where such variants are homologous or substantially 
similar to the naturally occurring protein, and can be of an origin of the same or different species as 

25 the naturally occurring protein (e.g., human, murine, or some other species that naturally expresses 
the recited polypeptide, usually a mammalian species). In general, variant polypeptides have a 
sequence that has at least about 80%, usually at least about 90%, and more usually at least about 
98% sequence identity with a differentially expressed polypeptide of the invention, as measured by 
BLAST 2.0 using the parameters described above. The variant polypeptides can be naturally or 

30 non-naturally glycosylated, i.e., the polypeptide has a glycosylation pattern that differs from the 
glycosylation pattern found in the corresponding naturally occurring protein. 

The invention also encompasses homologs of the disclosed polypeptides (or fragments 
thereof) where the homologs are isolated from other species, i.e. other animal or plant species, 
where such homologs, usually mammalian species, e.g. rodents, such as mice, rats; domestic 

35 animals, e.g., horse, cow, dog, cat; and humans. By "homolog" is meant a polypeptide having at 
least about 35%, usually at least about 40% and more usually at least about 60% amino acid 
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sequence identity to a particular differentially expressed protein as identified above, where 
sequence identity is determined using the BLAST 2.0 algorithm, with the parameters described 
supra. 

In general, the polypeptides of the subject invention are provided in a non-naturally 
5 occurring environment, e.g. are separated from their naturally occurring environment. In certain 
embodiments, the subject protein is present in a composition that is enriched for the protein as 
compared to a control. As such, purified polypeptide is provided, where by purified is meant that 
the protein is present in a composition that is substantially free of non-differentially expressed 
polypeptides, where by substantially free is meant that less than 90%, usually less than 60% and 
10 more usually less than 50% of the composition is made up of non-differentially expressed 
polypeptides. 

Also within the scope of the invention are variants; variants of polypeptides include 
mutants, fragments, and fusions. Mutants can include amino acid substitutions, additions or 
deletions. The amino acid substitutions can be conservative amino acid substitutions or 

15 substitutions to eliminate non-essential amino acids, such as to alter a glycosylation site, a 

phosphorylation site or an acetylation site, or to minimize misfolding by substitution or deletion of 
one or more cysteine residues that are not necessary for function. Conservative amino acid 
substitutions are those that preserve the general charge, hydrophobicity/ hydrophilicity, and/or 
steric bulk of the amino acid substituted. Variants can be designed so as to retain or have enhanced 

20 biological activity of a particular region of the protein (e.g., a functional domain and/or, where the 
polypeptide is a member of a protein family, a region associated with a consensus sequence). 
Selection of amino acid alterations for production of variants can be based upon the accessibility 
(interior vs. exterior) of the amino acid (see, e.g., Go et al, Int. J. Peptide Protein Res. (1980) 
15:21 1), the thermostability of the variant polypeptide (see, e.g., Querol et al., Prot. Eng. (1996) 

25 P:265), desired glycosylation sites (see, e.g., Olsen and Thomsen, J. Gen. Microbiol. (1991) 
757:579), desired disulfide bridges (see, e.g., Clarke et al., Biochemistry (1993) 32:4322; and 
Wakarchuk et al., Protein Eng. (1994) 7:1379), desired metal binding sites (see, e.g., Toma et al., 
Biochemistry (1991) 30:97, and Haezerbrouck et al, Protein Eng. (1993) (5:643), and desired 
substitutions with in proline loops (see, e.g., Masul et al., Appl. Env. Microbiol. (1994) 60:3519). 

30 Cysteine-depleted muteins can be produced as disclosed in USPN 4,959,3 14. 

Variants also include fragments of the polypeptides disclosed herein, particularly 
biologically active fragments and/or fragments corresponding to functional domains. Fragments of 
interest will typically be at least about 10 aa to at least about 15 aa in length, usually at least about 
50 aa in length, and can be as long as 300 aa in length or longer, but will usually not exceed about 

35 1000 aa in length, where the fragment will have a stretch of amino acids that is identical to a 
polypeptide encoded by a polynucleotide having a sequence of any SEQ ID NOS:l~316, or a 
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homolog thereof. The protein variants described herein are encoded by polynucleotides that are 
within the scope of the invention. The genetic code can be used to select the appropriate codons to 
construct the corresponding variants. 
Computer-Related Embodiments 
5 In general, a library of polynucleotides is a collection of sequence information, which 

information is provided in either biochemical form (e.g., as a collection of polynucleotide 
molecules), or in electronic form (e.g., as a collection of polynucleotide sequences stored in a 
computer-readable form, as in a computer system and/or as part of a computer program). The 
sequence information of the polynucleotides can be used in a variety of ways, e.g., as a resource for 

10 gene discovery, as a representation of sequences expressed in a selected cell type (e.g., cell type 
markers), and/or as markers of a given disease or disease state. In general, a disease marker is a 
representation of a gene product that is present in all cells affected by disease either at an increased 
or decreased level relative to a normal cell (e.g., a cell of the same or similar type that is not 
substantially affected by disease). For example, a polynucleotide sequence in a library can be a 

15 polynucleotide that represents an mRNA, polypeptide, or other gene product encoded by the 

polynucleotide, that is either overexpressed or underexpressed in a breast ductal cell affected by 
cancer relative to a normal (i.e., substantially disease-free) breast cell. 

The nucleotide sequence information of the library can be embodied in any suitable form, 
e.g., electronic or biochemical forms. For example, a library of sequence information embodied in 

20 electronic form comprises an accessible computer data file (or, in biochemical form, a collection of 
nucleic acid molecules) that contains the representative nucleotide sequences of genes that are 
differentially expressed (e.g., overexpressed or underexpressed) as between, for example, i) a 
cancerous cell and a normal cell; ii) a cancerous cell and a dysplastic cell; iii) a cancerous cell and a 
cell affected by a disease or condition other than cancer; iv) a metastatic cancerous cell and a 

25 normal cell and/or non-metastatic cancerous cell; v) a malignant cancerous cell and a non- 
malignant cancerous cell (or a normal cell) and/or vi) a dysplastic cell relative to a normal cell. 
Other combinations and comparisons of cells affected by various diseases or stages of disease will 
be readily apparent to the ordinarily skilled artisan. Biochemical embodiments of the library 
include a collection of nucleic acids that have the sequences of the genes in the library, where the 

30 nucleic acids can correspond to the entire gene in the library or to a fragment thereof, as described 
in greater detail below. 

The polynucleotide libraries of the subject invention generally comprise sequence 
information of a plurality of polynucleotide sequences, where at least one of the polynucleotides 
has a sequence of any of SEQ ID NOS: 1-3 16. By plurality is meant at least 2, usually at least 3 

35 and can include up to all of SEQ ID NOS:l-316. The length and number of polynucleotides in the 
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library will vary with the nature of the library, e.g., if the library is an oligonucleotide array, a 
cDNA array, a computer database of the sequence information, etc. 

Where the library is an electronic library, the nucleic acid sequence information can be 
present in a variety of media. "Media" refers to a manufacture, other than an isolated nucleic acid 
5 molecule, that contains the sequence information of the present invention. Such a manufacture 
provides the genome sequence or a subset thereof in a form that can be examined by means not 
directly applicable to the sequence as it exists in a nucleic acid. For example, the nucleotide 
sequence of the present invention, e.g. the nucleic acid sequences of any of the polynucleotides of 
SEQ ID NOS: 1-316, can be recorded on computer readable media, e.g. any medium that can be 

10 read and accessed directly by a computer. Such media include, but are not limited to: magnetic 
storage media, such as a floppy disc, a hard disc storage medium, and a magnetic tape; optical 
storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of 
these categories such as magnetic/optical storage media. One of skill in the art can readily 
appreciate how any of the presently known computer readable mediums can be used to create a 

15 manufacture comprising a recording of the present sequence information. "Recorded" refers to a 

process for storing information on computer readable medium, using any such methods as known in 
the art. Any convenient data storage structure can be chosen, based on the means used to access the 
stored information. A variety of data processor programs and formats can be used for storage, e.g. 
word processing text file, database format, etc. In addition to the sequence information, electronic 

20 versions of the libraries of the invention can be provided in conjunction or connection with other 

computer-readable information and/or other types of computer-readable files (e.g., searchable files, 
executable files, etc, including, but not limited to, for example, search program software, etc.). 

By providing the nucleotide sequence in computer readable form, the information can be 
accessed for a variety of purposes. Computer software to access sequence information is publicly 

25 available. For example, the gapped BLAST (Altschul et al. Nucleic Acids Res. (1997) 25:3389- 
3402) and BLAZE (Brutlag et al. Comp. Chem. (1993) 17:203) search algorithms on a Sybase 
system can be used to identify open reading frames (ORFs) within the genome that contain 
homology to ORFs from other organisms. 

As used herein, "a computer-based system" refers to the hardware means, software means, 

30 and data storage means used to analyze the nucleotide sequence information of the present 
invention. The minimum hardware of the computer-based systems of the present invention 
comprises a central processing unit (CPU), input means, output means, and data storage means. A 
skilled artisan can readily appreciate that any one of the currently available computer-based system 
are suitable for use in the present invention. The data storage means can comprise any 'manufacture 

35 comprising a recording of the present sequence information as described above, or a memory 
access means that can access such a manufacture. 
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"Search means" refers to one or more programs implemented on the computer-based 
system, to compare a target sequence or target structural motif, or expression levels of a 
polynucleotide in a sample, with the stored sequence information. Search means can be used to 
identify fragments or regions of the genome that match a particular target sequence or target motif. 
5 A variety of known algorithms are publicly known and commercially available, e.g. MacPattern 
(EMBL), BLASTN and BLASTX (NCBI). A "target sequence" can be any polynucleotide or 
amino acid sequence of six or more contiguous nucleotides or two or more amino acids, preferably 
from about 10 to 100 amino acids or from about 30 to 300 nt A variety of comparing means can be 
used to accomplish comparison of sequence information from a sample (e.g., to analyze target 

10 sequences, target motifs, or relative expression levels) with the data storage means. A skilled 

artisan can readily recognize that any one of the publicly available homology search programs can 
be used as the search means for the computer based systems of the present invention to accomplish 
comparison of target sequences and motifs. Computer programs to analyze expression levels in a 
sample and in controls are also known in the art. 

15 A "target structural motif," or "target motif," refers to any rationally selected sequence or 

combination of sequences in which the sequence(s) are chosen based on a three-dimensional 
configuration that is formed upon the folding of the target motif, or on consensus sequences of 
regulatory or active sites. There are a variety of target motifs known in the art. Protein target 
motifs include, but arc not limited to, enzyme active sites and signal sequences. Nucleic acid target 

20 motifs include, but are not limited to, hairpin structures, promoter sequences and other expression 
elements such as binding sites for transcription factors. 

A variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. One format for an 
output means ranks the relative expression levels of different polynucleotides. Such presentation 

25 provides a skilled artisan with a ranking of relative expression levels to determine a gene 
expression profile. 

As discussed above, the "library" of the invention also encompasses biochemical libraries 
of the polynucleotides of SEQ ID NOS:l-316 , e.g., collections of nucleic acids representing the 
provided polynucleotides. The biochemical libraries can take a variety of forms, e.g., a solution of 

30 cDNAs, a pattern of probe nucleic acids stably associated with a surface of a solid support (i.e., an 
array) and the like. Of particular interest are nucleic acid arrays in which one or more of SEQ ID 
NOS: 1-3 16 is represented on the array. By array is meant a an article of manufacture that has at 
least a substrate with at least two distinct nucleic acid targets on one of its surfaces, where the 
number of distinct nucleic acids can be considerably higher, typically being at least 10 nt, usually at 

35 least 20 nt and often at least 25 nt. A variety of different array formats have been developed and 
are known to those of skill in the art. The arrays of the subject invention find use in a variety of 
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applications, including gene expression analysis, drug screening, mutation analysis and the like, as 
disclosed in the above-listed exemplary patent documents. 

In addition to the above nucleic acid libraries, analogous libraries of polypeptides are also 
provided, where the where the polypeptides of the library will represent at least a portion of the 
5 polypeptides encoded by SEQ ID NOS : 1 -3 1 6. 
Utilities 

Use of Polynucleotide Probes in Mapping, and in Tissue Profiling 

Polynucleotide probes, generally comprising at least 12 contiguous nt of a polynucleotide 
as shown in the Sequence Listing, are used for a variety of purposes, such as chromosome mapping 
10 of the polynucleotide and detection of transcription levels. Additional disclosure about preferred 
regions of the disclosed polynucleotide sequences is found in the Examples. A probe that 
hybridizes specifically to a polynucleotide disclosed herein should provide a detection signal at 
least 5-, 10-, or 20-fold higher than the background hybridization provided with other unrelated 
sequences. 

15 Detection of Expression Levels. Nucleotide probes are used to detect expression of a gene 

corresponding to the provided polynucleotide. In Northern blots, mRNA is separated 
electrophoretically and contacted with a probe. A probe is detected as hybridizing to an mRNA 
species of a particular size. The amount of hybridization is quantitated to determine relative 
amounts of expression, for example under a particular condition. Probes are used for in situ 

20 hybridization to cells to detect expression. Probes can also be used in vivo for diagnostic detection 
of hybridizing sequences. Probes are typically labeled with a radioactive isotope. Other types of 
detectable labels can be used such as chromophores, fluors, and enzymes. Other examples of 
nucleotide hybridization assays are described in WO92/02526 and USPN 5,124,246. 

Alternatively, the Polymerase Chain Reaction (PCR) is another means for detecting small 

25 amounts of target nucleic acids (see, e.g., Mullis et al. 9 Meth. Enzymol (1987) / 5.5:335; USPN 

4,683,195; and USPN 4,683,202). Two primer polynucleotides nucleotides that hybridize with the 
target nucleic acids are used to prime the reaction. The primers can be composed of sequence 
within or 3' and 5 f to the polynucleotides of the Sequence Listing. Alternatively, if the primers are 3* 
and 5' to these polynucleotides, they need not hybridize to them or the complements. After 

30 amplification of the target with a thermostable polymerase, the amplified target nucleic acids can 
be detected by methods known in the art, e.g., Southern blot. mRNA or cDNA can also be detected 
by traditional blotting techniques (e.g., Southern blot, Northern blot, etc.) described in Sambrook et 
al., "Molecular Cloning: A Laboratory Manual" (New York, Cold Spring Harbor Laboratory, 
1989) (e.g., without PCR amplification). In general, mRNA or cDNA generated from mRNA using 

35 a polymerase enzyme can be purified and separated using gel electrophoresis, and transferred to a 
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solid support, such as nitrocellulose. The solid support is exposed to a labeled probe, washed to 
remove any unhybridized probe, and duplexes containing the labeled probe are detected. 

Mapping. Polynucleotides of the present invention can be used to identify a chromosome 
on which the corresponding gene resides. Such mapping can be useful in identifying the function of 
5 the polynucleotide-related gene by its proximity to other genes with known function. Function can 
also be assigned to the polynucleotide-related gene when particular syndromes or diseases map to 
the same chromosome. For example, use of polynucleotide probes in identification and 
quantification of nucleic acid sequence aberrations is described in USPN 5,783,387. An exemplary 
mapping method is fluorescence in situ hybridization (FISH), which facilitates comparative 

10 genomic hybridization to allow total genome assessment of changes in relative copy number of 
DNA sequences (see, e.g., Valdes et al, Methods in Molecular Biology (1997) 68:1). 
Polynucleotides can also be mapped to particular chromosomes using, for example, radiation 
hybrids or chromosome-specific hybrid panels. See Leach et al, Advances in Genetics , (1995) 
33:63-99; Walter et ah, Nature Genetics (1994) 7:22; Walter and Goodfellow, Trends in Genetics 

15 (1992) 9:352. Panels for radiation hybrid mapping are available from Research Genetics, Inc., 

Huntsville, Alabama, USA. Databases for markers using various panels are available via the world 
wide web at sites supported by the Stanford Human Genome Center (Stanford University) and the 
Whitehead Institute for Biomedical Research/MIT Center for Genome Research. The statistical 
program RHMAP can be used to construct a map based on the data from radiation hybridization 

20 with a measure of the relative likelihood of one order versus another. RHMAP is available via the 
world wide web at a site supported by the Center for Statistical Genetics at the University of 
Michigan School of Public Health. In addition, commercial programs are available for identifying 
regions of chromosomes commonly associated with disease, such as cancer. 

Tissue Typing or Profiling. Expression of specific mRNA corresponding to the provided 

25 polynucleotides can vary in different cell types and can be tissue- specific. This variation of mRNA 
levels in different cell types can be exploited with nucleic acid probe assays to determine tissue 
types. For example, PCR, branched DNA probe assays, or blotting techniques utilizing nucleic acid 
probes substantially identical or complementary to polynucleotides listed in the Sequence Listing 
can determine the presence or absence of the corresponding cDNA or mRNA. 

30 Tissue typing can be used to identify the developmental organ or tissue source of a 

metastatic lesion by identifying the expression of a particular marker of that organ or tissue. If a 
polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found to express 
that polynucleotide, then the developmental source of the lesion has been identified. Expression of 
a particular polynucleotide can be assayed by detection of either the corresponding mRNA or the 

35 protein product. As would be readily apparent to any forensic scientist, the sequences disclosed 
herein are useful in differentiating human tissue from non-human tissue. In particular, these 
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sequences are useful to differentiate human tissue from bird, reptile, and amphibian tissue, for 
example. 

Use of Polymorphisms. A polynucleotide of the invention can be used in forensics, genetic 
analysis, mapping, and diagnostic applications where the corresponding region of a gene is 
5 polymorphic in the human population. Any means for detecting a polymorphism in a gene can be 
used, including, but not limited to electrophoresis of protein polymorphic variants, differential 
sensitivity to restriction enzyme cleavage, and hybridization to allele-specific probes. 

Antibody Production 

Expression products of a polynucleotide of the invention, as well as the corresponding 

10 mRNA, cDNA, or complete gene, can be prepared and used for raising antibodies for experimental, 
diagnostic, an<d therapeutic purposes. For polynucleotides to which a corresponding gene has not 
been assigned, this provides an additional method of identifying the corresponding gene. The 
polynucleotide or related cDNA is expressed as described above, and antibodies are prepared. 
These antibodies are specific to an epitope on the polypeptide encoded by the polynucleotide, and 

15 can precipitate or bind to the corresponding native protein in a cell or tissue preparation or in a cell- 
free extract of an in vitro expression system. 

Methods for production of antibodies that specifically bind a selected antigen are well 
known in the art. Immunogens for raising antibodies can be prepared by mixing a polypeptide 
encoded by a polynucleotide of the invention with an adjuvant, and/or by making fusion proteins 

20 with larger immunogenic proteins. Polypeptides can also be covalently linked to other larger 
immunogenic proteins, such as keyhole limpet hemocyanin. Immunogens are typically 
administered intradermally, subcutaneously, or intramuscularly to experimental animals such as 
rabbits, sheep, and mice, to generate antibodies. Monoclonal antibodies can be Monoclonal 
antibodies can be generated by isolating spleen cells and fusing myeloma cells to form hybridomas. 

25 Alternatively, the selected polynucleotide is administered directly, such as by intramuscular 
injection, and expressed in vivo. The expressed protein generates a variety of protein-specific 
immune responses, including production of antibodies, comparable to administration of the protein. 

Preparations of polyclonal and monoclonal antibodies specific for polypeptides encoded by 
a selected polynucleotide are made using standard methods known in the art. The antibodies 

30 specifically bind to epitopes present in the polypeptides encoded by polynucleotides disclosed in 
the Sequence Listing. Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to 
form an epitope. Epitopes that involve non-contiguous amino acids may require a longer 
polypeptide, e.g., at least 15, 25, or 50 amino acids. Antibodies that specifically bind to human 
polypeptides encoded by the provided polypeptides should provide a detection signal at least 5-, 10- 

35 , or 20-fold higher than a detection signal provided with other proteins when used in Western blots 
or other immunochemical assays. Preferably, antibodies that specifically polypeptides of the 
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invention do not bind to other proteins in immunochemical assays at detectable levels and can 
immunoprecipitate the specific polypeptide from solution. 

The invention also contemplates naturally occurring antibodies specific for a polypeptide of 
the invention. For example, serum antibodies to a polypeptide of the invention in a human 
5 population can be purified by methods well known in the art, e.g., by passing antiserum over a 
column to which the corresponding selected polypeptide or fusion protein is bound. The bound 
antibodies can then be eluted from the column, for example using a buffer with a high salt 
concentration. 

In addition to the antibodies discussed above, the invention also contemplates genetically 
10 engineered antibodies, antibody derivatives (e.g., single chain antibodies, antibody fragments (e.g., 
Fab, etc.)), according to methods well known in the art. 
Polynucleotides or Arrays for Diagnostics 

Polynucleotide arrays provide a high throughput technique that can assay a large number of 
polynucleotide sequences in a sample. This technology can be used as a diagnostic and as a tool to 

15 test for differential expression, e.g., to determine function of an encoded protein. Arrays can be 
created by spotting polynucleotide probes onto a substrate {e.g., glass, nitrocelllose, etc.) in a two- 
dimensional matrix or array having bound probes. The probes can be bound to the substrate by 
either covalent bonds or by non-specific interactions, such as hydrophobic interactions. Samples of 
polynucleotides can be detectably labeled (e.g., using radioactive or fluorescent labels) and then 

20 hybridized to the probes. Double stranded polynucleotides, comprising the labeled sample 

polynucleotides bound to probe polynucleotides, can be detected once the unbound portion of the 
sample is washed away. Techniques for constructing arrays and methods of using these arrays are 
described in EP 799 897; WO 97/29212; WO 97/27317; EP 785 280; WO 97/02357; USPN 
5,593,839; USPN 5,578,832; EP 728 520; USPN 5,599,695; EP 721 016; USPN 5,556,752; WO 

25 95/22058; and USPN 5,63 1,734. Arrays can be used to, for example, examine differential 

expression of genes and can be used to determine gene function. For example, arrays can be used 
to detect differential expression of a polynucleotide between a test cell and control cell (e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer cell, 
which is not observed in a corresponding normal cell, can indicate a cancer specific gene product. 

30 Exemplary uses of arrays are further described in, for example, Pappalarado et al, Sem. Radiation 
Oncol. (1998) 5:217; and Ramsay Nature Biotechnol (1998) 7(5:40. 
Differential Expression in Diagnosis 

The polynucleotides of the invention can also be used to detect differences in expression 
levels between two cells, e.g., as a method to identify abnormal or diseased tissue in a human. For 
35 polynucleotides corresponding to profiles of protein families, the choice of tissue can be selected 
according to the putative biological function. In general, the expression of a gene corresponding to 

26 



WO 01/72781 



PCT/US01/09952 



a specific polynucleotide is compared between a first tissue that is suspected of being diseased and 
a second, normal tissue of the human. The tissue suspected of being abnormal or diseased can be 
derived from a different tissue type of the human, but preferably it is derived from the same tissue 
type; for example an intestinal polyp or other abnormal growth should be compared with normal 
5 intestinal tissue. The normal tissue can be the same tissue as that of the test sample, or any normal 
tissue of the patient, especially those that express the polynucleotide-related gene of interest (e.g., 
brain, thymus, testis, heart, prostate, placenta, spleen, small intestine, skeletal muscle, pancreas, and 
the mucosal lining of the colon). A difference between the polynucleotide-related gene, mRNA, or 
protein in the two tissues which are compared, for example in molecular weight, amino acid or 
10 nucleotide sequence, or relative abundance, indicates a change in the gene, or a gene which 

regulates it, in the tissue of the human that was suspected of being diseased. Examples of detection 
of differential expression and its use in diagnosis of cancer are described in USPNs 5,688,641 and 
5,677,125. 

A genetic predisposition to disease in a human can also be detected by comparing 

15 expression levels of an mRNA or protein corresponding to a polynucleotide of the invention in a 
fetal tissue with levels associated in normal fetal tissue. Fetal tissues that are used for this purpose 
include, but are not limited to, amniotic fluid, chorionic villi, blood, and the blastomere of an in 
vitro-fertilized embryo. The comparable normal polynucleotide-related gene is obtained from any 
tissue. The mRNA or protein is obtained from a normal tissue of a human in which the 

20 polynucleotide-related gene is expressed. Differences such as alterations in the nucleotide 
sequence or size of the same product of the fetal polynucleotide-related gene or mRNA, or 
alterations in the molecular weight, amino acid sequence, or relative abundance of fetal protein, can 
indicate a germline mutation in the polynucleotide-related gene of the fetus, which indicates a 
genetic predisposition to disease. In general, diagnostic, prognostic, and other methods of the 

25 invention based on differential expression involve detection of a level or amount of a gene product, 
particularly a differentially expressed gene product, in a test sample obtained from a patient 
suspected of having or being susceptible to a disease (e.g., breast cancer, lung cancer, colon cancer 
and/or metastatic forms thereof), and comparing the detected levels to those levels found in normal 
cells (e.g., cells substantially unaffected by cancer) and/or other control cells (e.g., to differentiate a 

30 cancerous cell from a cell affected by dysplasia). Furthermore, the severity of the disease can be 
assessed by comparing the detected levels of a differentially expressed gene product with those 
levels detected in samples representing the levels of differentially gene product associated with 
varying degrees of severity of disease. It should be noted that use of the term "diagnostic" herein is 
not necessarily meant to exclude "prognostic" or "prognosis," but rather is used as a matter of 

35 convenience. 
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The term "differentially expressed gene" is generally intended to encompass a 
polynucleotide that can, for example, include an open reading frame encoding a gene product (e.g., 
a polypeptide), and/or introns of such genes and adjacent 5 1 and 3 ? non-coding nucleotide sequences 
involved in the regulation of expression, up to about 20 kb beyond the coding region, but possibly 
5 further in either direction. The gene can be introduced into an appropriate vector for 

extrachromosomal maintenance or for integration into a host genome. In general, a difference in 
expression level associated with a decrease in expression level of at least about 25%, usually at 
least about 50% to 75%, more usually at least about 90% or more is indicative of a differentially 
expressed gene of interest, i.e., a gene that is underexpressed or down-regulated in the test sample 

10 relative to a control sample. Furthermore, a difference in expression level associated with an 
increase in expression of at least about 25%, usually at least about 50% to 75%, more usually at 
least about 90% and can be at least about 1 /2-fold, usually at least about 2-fold to about 10-fold, 
and can be about 100-fold to about 1,000-fold increase relative to a control sample is indicative of a 
differentially expressed gene of interest, i.e., an overexpressed or up-regulated gene. 

15 "Differentially expressed polynucleotide" as used herein means a nucleic acid molecule 

(RNA or DNA) comprising a sequence that represents a differentially expressed gene, e.g., the 
differentially expressed polynucleotide comprises a sequence (e.g., an open reading frame encoding 
a gene product) that uniquely identifies a differentially expressed gene so that detection of the 
differentially expressed polynucleotide in a sample is correlated with the presence of a 

20 differentially expressed gene in a sample. "Differentially expressed polynucleotides" is also meant 
to encompass fragments of the disclosed polynucleotides, e.g., fragments retaining biological 
activity, as well as nucleic acids homologous, substantially similar, or substantially identical (e.g., 
having about 90% sequence identity) to the disclosed polynucleotides. 

"Diagnosis" as used herein generally includes determination of a subject's susceptibility to 

25 a disease or disorder, determination as to whether a subject is presently affected by a disease or 
disorder, as well as to the prognosis of a subject affected by a disease or disorder (e.g., 
identification of pre-metastatic or metastatic cancerous states, stages of cancer, or responsiveness of 
cancer to therapy). The present invention particularly encompasses diagnosis of subjects in the 
context of breast cancer (e.g., carcinoma in situ (e.g., ductal carcinoma in situ), estrogen receptor 

30 (ER)-positive breast cancer, ER-negative breast cancer, or other forms and/or stages of breast 

cancer), lung cancer (e.g., small cell carcinoma, non-small cell carcinoma, mesothelioma, and other 
forms and/or stages of lung cancer), and colon cancer (e.g., adenomatous polyp, colorectal 
carcinoma, and other forms and/or stages of colon cancer). 

"Sample" or "biological sample" as used throughout here are generally meant to refer to 

35 samples of biological fluids or tissues, particularly samples obtained from tissues, especially from 
cells of the type associated with the disease for which the diagnostic application is designed (e.g., 
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ductal adenocarcinoma), and the like. "Samples" is also meant to encompass derivatives and 
fractions of such samples (e.g., cell lysates). Where the sample is solid tissue, the cells of the tissue 
can be dissociated or tissue sections can be analyzed. 

Methods of the subject invention useful in diagnosis or prognosis typically involve 
5 comparison of the abundance of a selected differentially expressed gene product in a sample of 
interest with that of a control to determine any relative differences in the expression of the gene 
product, where the difference can be measured qualitatively and/or quantitatively. Quantitation can 
be accomplished, for example, by comparing the level of expression product detected in the sample 
with the amounts of product present in a standard curve. A comparison can be made visually; by 

10 using a technique such as densitometry, with or without computerized assistance; by preparing a 

representative library of cDNA clones of mRNA isolated from a test sample, sequencing the clones 
in the library to determine that number of cDNA clones corresponding to the same gene product, 
and analyzing the number of clones corresponding to that same gene product relative to the number 
of clones of the same gene product in a control sample; or by using an array to detect relative levels 

15 of hybridization to a selected sequence or set of sequences, and comparing the hybridization pattern 
to that of a control. The differences in expression are then correlated with the presence or absence 
of an abnormal expression pattern. A variety of different methods for determining the nucleic acid 
abundance in a sample are known to those of skill in the art (see, e.g., WO 97/27317). 

In general, diagnostic assays of the invention involve detection of a gene product of a the 

20 polynucleotide sequence (e.g., mRNA or polypeptide) that corresponds to a sequence of SEQ ID 

NOS:l-3 16 The patient from whom the sample is obtained can be apparently healthy, susceptible to 
disease (e.g., as determined by family history or exposure to certain environmental factors), or can 
already be identified as having a condition in which altered expression of a gene product of the 
invention is implicated. 

25 Diagnosis can be determined based on detected gene product expression levels of a gene 

product encoded by at least one, preferably at least two or more, at least 3 or more, or at least 4 or 
more of the polynucleotides having a sequence set forth in SEQ ID NOS: 1-3 16, and can involve 
detection of expression of genes corresponding to all of SEQ ID NOS: 1-3 16 and/or additional 
sequences that can serve as additional diagnostic markers and/or reference sequences. Where the 

30 diagnostic method is designed to detect the presence or susceptibility of a patient to cancer, the 
assay preferably involves detection of a gene product encoded by a gene corresponding to a 
polynucleotide that is differentially expressed in cancer. Examples of such differentially expressed 
polynucleotides are described in the Examples below. Given the provided polynucleotides and 
information regarding their relative expression levels provided herein, assays using such 

35 polynucleotides and detection of their expression levels in diagnosis and prognosis will be readily 
apparent to the ordinarily skilled artisan. 
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Any of a variety of detectable labels can be used in connection with the various 
embodiments of the diagnostic methods of the invention. Suitable detectable labels include 
fluorochromes,(e.g-. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
allophycocyanin, 6-carboxyfluorescein (6-FAM), 2%7'-dimethoxy-4\5'-dichloro-6- 
5 carboxyfluorescein, 6-carboxy-X-rhodamine (ROX), 6-carboxy-2',4' ? 7',4 5 7-hexachlorofluorescein 
(HEX), 5-carboxyfluorescein(5-FAM) or N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA)), 
radioactive labels, (e.g. 32 P, 35 S, 3 H, etc.), and the like. The detectable label can involve a two stage 
systems (e.g., biotin-avidin, hapten-anti-hapten antibody, etc.) 

Reagents specific for the polynucleotides and polypeptides of the invention, such as 

10 antibodies and nucleotide probes, can be supplied in a kit for detecting the presence of an 

expression product in a biological sample. The kit can also contain buffers or labeling components, 
as well as instructions for using the reagents to detect and quantify expression products in the 
biological sample. Exemplary embodiments of the diagnostic methods of the invention are 
described below in more detail. 

15 Polypeptide detection in diagnosis. In one embodiment, the test sample is assayed for the 

level of a differentially expressed polypeptide. Diagnosis can be accomplished using any of a 
number of methods to determine the absence or presence or altered amounts of the differentially 
expressed polypeptide in the test sample. For example, detection can utilize staining of cells or 
histological sections with labeled antibodies, performed in accordance with conventional methods. 

20 Cells can be permeabilized to stain cytoplasmic molecules. In general, antibodies that specifically 
bind a differentially expressed polypeptide of the invention are added to a sample, and incubated 
for a period of time sufficient to allow binding to the epitope, usually at least about 10 minutes. 
The antibody can be detectably labeled for direct detection (e.g., using radioisotopes, enzymes, 
fluorescers, chemiluminescers, and the like), or can be used in conjunction with a second stage 

25 antibody or reagent to detect binding (e.g., biotin with horseradish peroxidase-conjugated avidin, a 
secondary antibody conjugated to a fluorescent compound, e.g. fluorescein, rhodamine, Texas red, 
etc.). The absence or presence of antibody binding can be determined by various methods, 
including flow cytometry of dissociated cells, microscopy, radiography, scintillation counting, etc. 
Any suitable alternative methods can of qualitative or quantitative detection of levels or amounts of 

30 differentially expressed polypeptide can be used, for example ELISA, western blot, 
immunoprecipitation, radioimmunoassay, etc. 

mRNA detection. The diagnostic methods of the invention can also or alternatively involve 
detection of mRNA encoded by a gene corresponding to a differentially expressed polynucleotides 
of the invention. Any suitable qualitative or quantitative methods known in the art for detecting 

35 specific mRNAs can be used. mRNA can be detected by, for example, in situ hybridization in 

tissue sections, by reverse transcriptase-PCR, or in Northern blots containing poly A+ mRNA. One 
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of skill in the art can readily use these methods to determine differences in the size or amount of 
mRNA transcripts between two samples. mRNA expression levels in a sample can also be 
determined by generation of a library of expressed sequence tags (ESTs) from the sample, where 
the EST library is representative of sequences present in the sample (Adams, et al., (1991) Science 
5 252:1651). Enumeration of the relative representation of ESTs within the library can be used to 
approximate the relative representation of the gene transcript within the starting sample. The 
results of EST analysis of a test sample can then be compared to EST analysis of a reference sample 
to determine the relative expression levels of a selected polynucleotide, particularly a 
polynucleotide corresponding to one or more of the differentially expressed genes described herein. 

10 Alternatively, gene expression in a test sample can be performed using serial analysis of gene 

expression (SAGE) methodology (e.g., Velculescu et al., Science (1995) 270:484) or differential 
display (DD) methodology (see, e.g., U.S. 5,776,683; and U.S. 5,807,680). 

Alternatively, gene expression can be analyzed using hybridization analysis. 
Oligonucleotides or cDNA can be used to selectively identify or capture DNA or RNA of specific 

15 sequence composition, and the amount of RNA or cDNA hybridized to a known capture sequence 
determined qualitatively or quantitatively, to provide information about the relative representation 
of a particular message within the pool of cellular messages in a sample. Hybridization analysis 
can be designed to allow for concurrent screening of the relative expression of hundreds to 
thousands of genes by using, for example, array-based technologies having high density formats, 

20 including filters, microscope slides, or microchips, or solution-based technologies that use 

spectroscopic analysis {e.g., mass spectrometry). One exemplary use of arrays in the diagnostic 
methods of the invention is described below in more detail. 

Use of a single gene in diagnostic applications. The diagnostic methods of the invention 
can focus on the expression of a single differentially expressed gene. For example, the diagnostic 

25 method can involve detecting a differentially expressed gene, or a polymorphism of such a gene 
{e.g., a polymorphism in an coding region or control region), that is associated with disease. 
Disease-associated polymorphisms can include deletion or truncation of the gene, mutations that 
alter expression level and/or affect activity of the encoded protein, etc. 

A number of methods are available for analyzing nucleic acids for the presence of a 

30 specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA are 
available, genomic DNA is used directly. Alternatively, the region of interest is cloned into a 
suitable vector and grown in sufficient quantity for analysis. Cells that express a differentially 
expressed gene can be used as a source of mRNA, which can be assayed directly or reverse 
transcribed into cDNA for analysis. The nucleic acid can be amplified by conventional techniques, 

35 such as the polymerase chain reaction (PGR), to provide sufficient amounts for analysis, and a 
detectable label can be included in the amplification reaction {e.g., using a detectably labeled 
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primer or detectably labeled oligonucleotides) to facilitate detection. Alternatively, various methods 
are also known in the art that utilize oligonucleotide ligation as a means of detecting 
polymorphisms, see e.g., Riley et ah, NucL Acids Res. (1990) 75:2887; and Delahunty et al., Am. J. 
Hum. Genet. (1996) 55:1239. 
5 The amplified or cloned sample nucleic acid can be analyzed by one of a number of 

methods known in the art. The nucleic acid can be sequenced by dideoxy or other methods, and the 
sequence of bases compared to a selected sequence, e.g., to a wild-type sequence. Hybridization 
with the polymorphic or variant sequence can also be used to determine its presence in a sample 
(e.g., by Southern blot, dot blot, etc.). The hybridization pattern of a polymorphic or variant 

10 sequence and a control sequence to an array of oligonucleotide probes immobilized on a solid 
support, as described in US 5,445,934, or in WO 95/35505, can also be used as a means of 
identifying polymorphic or variant sequences associated with disease. Single strand conformational 
polymorphism (SSCP) analysis, denaturing gradient gel electrophoresis (DGGE), and heteroduplex 
analysis in gel matrices are used to detect conformational changes created by DNA sequence 

15 variation as alterations in electrophoretic mobility. Alternatively, where a polymorphism creates or 
destroys a recognition site for a restriction endonuclease, the sample is digested with that 
endonuclease, and the products size fractionated to determine whether the fragment was digested. 
Fractionation is performed by gel or capillary electrophoresis, particularly acrylamide or agarose 
gels. 

20 Screening for mutations in a gene can be based on the functional or antigenic 

characteristics of the protein. Protein truncation assays are useful in detecting deletions that can 
affect the biological activity of the protein. Various immunoassays designed to detect 
polymorphisms in proteins can be used in screening. Where many diverse genetic mutations lead to 
a particular disease phenotype, functional protein assays have proven to be effective screening 

25 tools. The activity of the encoded protein can be determined by comparison with the wild-type 
protein. 

Pattern matching in diagnosis using arrays. In another embodiment, the diagnostic and/or 
prognostic methods of the invention involve detection of expression of a selected set of genes in a 
test sample to produce a test expression pattern (TEP). The TEP is compared to a reference 

30 expression pattern (REP), which is generated by detection of expression of the selected set of genes 
in a reference sample (e.g., a positive or negative control sample). The selected set of genes 
includes at least one of the genes of the invention, which genes correspond to the polynucleotide 
sequences of SEQ ID NOS: 1-316. Of particular interest is a selected set of genes that includes gene 
differentially expressed in the disease for which the test sample is to be screened. 

35 "Reference sequences" or "reference polynucleotides" as used herein in the context of 

differential gene expression analysis and diagnosis/prognosis refers to a selected set of 
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polynucleotides, which selected set includes at least one or more of the differentially expressed 
polynucleotides described herein. A plurality of reference sequences, preferably comprising 
positive and negative control sequences, can be included as reference sequences. Additional 
suitable reference sequences are found in GenBank, Unigene, and other nucleotide sequence 
5 databases (including, e.g., expressed sequence tag (EST), partial, and full-length sequences). 

"Reference array" means an array having reference sequences for use in hybridization with 
a sample, where the reference sequences include all, at least one of, or any subset of the 
differentially expressed polynucleotides described herein. Usually such an array will include at ■ 
least 3 different reference sequences, and can include any one or all of the provided differentially 

10 expressed sequences. Arrays of interest can further comprise sequences, including polymorphisms, 
of other genetic sequences, particularly other sequences of interest for screening for a disease or 
disorder (e.g., cancer, dysplasia, or other related or unrelated diseases, disorders, or conditions). 
The oligonucleotide sequence on the array will usually be at least about 12 nt in length, and can be 
of about the length of the provided sequences, or can extend into the flanking regions to generate 

15 fragments of 100 nt to 200 nt in length or more. Reference arrays can be produced according to any 
suitable methods known in the art. For example, methods of producing large arrays of 
oligonucleotides are described in U.S. 5,134,854, and U.S. 5,445,934 using light-directed synthesis 
techniques. Using a computer controlled system, a heterogeneous array of monomers is converted, 
through simultaneous coupling at a number of reaction sites, into a heterogeneous array of 

20 polymers. Alternatively, microarrays are generated by deposition of pre-synthesized 

oligonucleotides onto a solid substrate, for example as described in PCT published application no. 
WO 95/35505. 

A "reference expression pattern" or "REP" as used herein refers to the relative levels of 
expression of a selected set of genes, particularly of differentially expressed genes, that is 

25 associated with a selected cell type, e.g., a normal cell, a cancerous cell, a cell exposed to an 

environmental stimulus, and the like. A "test expression pattern" or "TEP" refers to relative levels 
of expression of a selected set of genes, particularly of differentially expressed genes, in a test 
sample (e.g., a cell of unknown or suspected disease state, from which mRNA is isolated). 

REPs can be generated in a variety of ways according to methods well known in the art. 

30 For example, REPs can be generated by hybridizing a control sample to an array having a selected 
set of polynucleotides (particularly a selected set of differentially expressed polynucleotides), 
acquiring the hybridization data from the array, and storing the data in a format that allows for 
ready comparison of the REP with a TEP. Alternatively, all expressed sequences in a control 
sample can be isolated and sequenced, e.g., by isolating mRNA from a control sample, converting 

35 the mRNA into cDNA, and sequencing the cDNA. The resulting sequence information roughly or 
precisely reflects the identity and relative number of expressed sequences in the sample. The 
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sequence information can then be stored in a format {e.g., a computer-readable format) that allows 
for ready comparison of the REP with a TEP. The REP can be normalized prior to or after data 
storage, and/or can be processed to selectively remove sequences of expressed genes that are of less 
interest or that might complicate analysis {e.g., some or all of the sequences associated with 
housekeeping genes can be eliminated from REP data). 

TEPs can be generated in a manner similar to REPs, e.g., by hybridizing a test sample to an 
array having a selected set of polynucleotides, particularly a selected set of differentially expressed 
polynucleotides, acquiring the hybridization data from the array, and storing the data in a format 
that allows for ready comparison of the TEP with a REP. The REP and TEP to be used in a 
comparison can be generated simultaneously, or the TEP can be compared to previously generated 
and stored REPs. 

In one embodiment of the invention, comparison of a TEP with a REP involves hybridizing 
a test sample with a reference array, where the reference array has one or more reference sequences 
for use in hybridization with a sample. The reference sequences include all, at least one of, or any 
subset of the differentially expressed polynucleotides described herein. Hybridization data for the 
test sample is acquired, the data normalized, and the produced TEP compared with a REP generated 
using an array having the same or similar selected set of differentially expressed polynucleotides. 
Probes that correspond to sequences differentially expressed between the two samples will show 
decreased or increased hybridization efficiency for one of the samples relative to the other. 

Methods for collection of data from hybridization of samples with a reference arrays are 
well known in the art. For example, the polynucleotides of the reference and test samples can be 
generated using a detectable fluorescent label, and hybridization of the polynucleotides in the 
samples detected by scanning the microarrays for the presence of the detectable label using, for 
example, a microscope and light source for directing light at a substrate. A photon counter detects 
fluorescence from the substrate, while an x-y translation stage varies the location of the substrate. 
A confocal detection device that can be used in the subject methods is described in USPN 
5,631,734. A scanning laser microscope is described in Shalon et al., Genome Res. (1996) <J:639. 
A scan, using the appropriate excitation line, is performed for each fluorophore used. The digital 
images generated from the scan are then combined for subsequent analysis. For any particular 
array element, the ratio of the fluorescent signal from one sample {e.g., a test sample) is compared 
to the fluorescent signal from another sample {e.g., a reference sample), and the relative signal 
intensity determined. 

Methods for analyzing the data collected from hybridization to arrays are well known in the 
art. For example, where detection of hybridization involves a fluorescent label, data analysis can 
include the steps of determining fluorescent intensity as a function of substrate position from the 
data collected, removing outliers, i.e. data deviating from a predetermined statistical distribution, 
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and calculating the relative binding affinity of the targets from the remaining data. The resulting 
data can be displayed as an image with the intensity in each region varying according to the binding 
affinity between targets and probes. 

In general, the test sample is classified as having a gene expression profile corresponding to 
5 that associated with a disease or non-disease state by comparing the TEP generated from the test 
sample to one or more REPs generated from reference samples (e.g., from samples associated with 
cancer or specific stages of cancer, dysplasia, samples affected by a disease other than cancer, 
normal samples, etc.). The criteria for a match or a substantial match between a TEP and a REP 
include expression of the same or substantially the same set of reference genes, as well as 

10 expression of these reference genes at substantially the same levels (e.g., no significant difference 
between the samples for a signal associated with a selected reference sequence after normalization 
of the samples, or at least no greater than about 25% to about 40% difference in signal strength for 
a given reference sequence. In general, a pattern match between a TEP and a REP includes a match 
in expression, preferably a match in qualitative or quantitative expression level, of at least one of, 

15 all or any subset of the differentially expressed genes of the invention. 

Pattern matching can be performed manually, or can be performed using a computer 
program. Methods for preparation of substrate matrices (e.g., arrays), design of oligonucleotides 
for use with such matrices, labeling of probes, hybridization conditions, scanning of hybridized 
matrices, and analysis of patterns generated, including comparison analysis, are described in, for 

20 example, U.S. 5,800,992. 

Diagnosis, Prognosis and Management of Cancer 

The polynucleotides of the invention and their gene products are of particular interest as 
genetic or biochemical markers (e.g., in blood or tissues) that will detect the earliest changes along 
the carcinogenesis pathway and/or to monitor the efficacy of various therapies and preventive 

25 interventions. For example, the level of expression of certain polynucleotides can be indicative of a 
poorer prognosis, and therefore warrant more aggressive chemo- or radio-therapy for a patient or 
vice versa. The correlation of novel surrogate tumor specific features with response to treatment 
and outcome in patients can define prognostic indicators that allow the design of tailored therapy 
based on the molecular profile of the tumor. These therapies include antibody targeting and gene 

30 therapy. Determining expression of certain polynucleotides and comparison of a patients profile 
with known expression in normal tissue and variants of the disease allows a determination of the 
best possible treatment for a patient, both in terms of specificity of treatment and in terms of 
comfort level of the patient. Surrogate tumor markers, such as polynucleotide expression, can also 
be used to better classify, and thus diagnose and treat, different forms and disease states of cancer. 

35 Two classifications widely used in oncology that can benefit from identification of the expression 
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levels of the polynucleotides of the invention are staging of the cancerous disorder, and grading the 
nature of the cancerous tissue. 

The polynucleotides of the invention can be useful to monitor patients having or 
susceptible to cancer to detect potentially malignant events at a molecular level before they are 
5 detectable at a gross morphological level. Furthermore, a polynucleotide of the invention identified 
as important for one type of cancer can also have implications for development or risk of 
development of other types of cancer, e.g., where a polynucleotide is differentially expressed across 
various cancer types. Thus, for example, expression of a polynucleotide that has clinical 
implications for metastatic colon cancer can also have clinical implications for stomach cancer or 

10 endometrial cancer. 

Staging. Staging is a process used by physicians to describe how advanced the cancerous 
state is in a patient. Staging assists the physician in determining a prognosis, planning treatment 
and evaluating the results of such treatment. Staging systems vary with the types of cancer, but 
generally involve the following "TNM" system: the type of tumor, indicated by T; whether the 

15 cancer has metastasized to nearby lymph nodes, indicated by N; and whether the cancer has 
metastasized to more distant parts of the body, indicated by M. Generally, if a cancer is only 
detectable in the area of the primary lesion without having spread to any lymph nodes it is called 
Stage I. If it has spread only to the closest lymph nodes, it is called Stage H. In Stage III, the 
cancer has generally spread to the lymph nodes in near proximity to the site of the primary lesion. 

20 Cancers that have spread to a distant part of the body, such as the liver, bone, brain or other site, are 
Stage IV, the most advanced stage. 

The polynucleotides of the invention can facilitate fine-tuning of the staging process by 
identifying markers for the aggresivity of a cancer, e.g. the metastatic potential, as well as the 
presence in different areas of the body. Thus, a Stage II cancer with a polynucleotide signifying a 

25 high metastatic potential cancer can be used to change a borderline Stage II tumor to a Stage III 

tumor, justifying more aggressive therapy. Conversely, the presence of a polynucleotide signifying 
a lower metastatic potential allows more conservative staging of a tumor. 

Grading of cancers. Grade is a term used to describe how closely a tumor resembles 
normal tissue of its same type. The microscopic appearance of a tumor is used to identify tumor 

30 grade based on parameters such as cell morphology, cellular organization, and other markers of 
differentiation. As a general rule, the grade of a tumor corresponds to its rate of growth or 
aggressiveness, with undifferentiated or high-grade tumors being more aggressive than well 
differentiated or low-grade tumors. The following guidelines are generally used for grading tumors: 
1) GX Grade cannot be assessed; 2) Gl Well differentiated; G2 Moderately well differentiated; 3) 

35 G3 Poorly differentiated; 4) G4 Undifferentiated. The polynucleotides of the invention can be 

especially valuable in determining the grade of the tumor, as they not only can aid in determining 
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the differentiation status of the cells of a tumor, they can also identify factors other than 
differentiation that are valuable in determining the aggressiveness of a tumor, such as metastatic 
potential. 

Detection of lung cancer. The polynucleotides of the invention can be used to detect lung 
5 cancer in a subject. Although there are more than a dozen different kinds of lung cancer, the two 
main types of lung cancer are small cell and nonsmall cell, which encompass about 90% of all lung 
cancer cases. Small cell carcinoma (also called oat cell carcinoma) usually starts in one of the 
larger bronchial tubes, grows fairly rapidly, and is likely to be large by the time of diagnosis. 
Nonsmall cell lung cancer (NSCLC) is made up of three general subtypes of lung cancer. 

10 Epidermoid carcinoma (also called squamous cell carcinoma) usually starts in one of the larger 

bronchial tubes and grows relatively slowly. The size of these tumors can range from very small to 
quite large. Adenocarcinoma starts growing near the outside surface of the lung and can vary in 
both size and growth rate. Some slowly growing adenocarcinomas are described as alveolar cell 
cancer. Large cell carcinoma starts near the surface of the lung, grows rapidly, and the growth is 

15 usually fairly large when diagnosed. Other less common forms of lung cancer are carcinoid, 
cylindroma, mucoepidermoid, and malignant mesothelioma. 

The polynucleotides of the invention, e.g., polynucleotides differentially expressed in 
normal cells versus cancerous lung cells (e.g., tumor cells of high or low metastatic potential) or 
between types of cancerous lung cells (e.g., high metastatic versus low metastatic), can be used to 

20 distinguish types of lung cancer as well as identifying traits specific to a certain patient's cancer and 
selecting an appropriate therapy. For example, if the patient's biopsy expresses a polynucleotide 
that is associated with a low metastatic potential, it may justify leaving a larger portion of the 
patient's lung in surgery to remove the lesion. Alternatively, a smaller lesion with expression of a 
polynucleotide that is associated with high metastatic potential may justify a more radical removal 

25 of lung tissue and/or the surrounding lymph nodes, even if no metastasis can be identified through 
pathological examination. 

Detection of breast cancer. The majority of breast cancers are adenocarcinomas subtypes, 
which can be summarized as follows: 1) ductal carcinoma in situ (DCIS), including 
comedocarcinoma; 2) infiltrating (or invasive) ductal carcinoma (IDC); 3) lobular carcinoma in situ 

30 (LCIS); 4) infiltrating (or invasive) lobular carcinoma (ILC); 5) inflammatory breast cancer; 6) 

medullary carcinoma; 7) mucinous carcinoma; 8) Paget's disease of the nipple; 9) Phyllodes tumor; 
and 10) tubular carcinoma; 

The expression of polynucleotides of the invention can be used in the diagnosis and 
management of breast cancer, as well as to distinguish between types of breast cancer. Detection of 

35 breast cancer can be determined using expression levels of any of the appropriate polynucleotides 
of the invention, either alone or in combination. Determination of the aggressive nature and/or the 
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metastatic potential of a breast cancer can also be determined by comparing levels of one or more 
polynucleotides of the invention and comparing levels of another sequence known to vary in 
cancerous tissue, e.g. ER expression. In addition, development of breast cancer can be detected by 
examining the ratio of expression of a differentially expressed polynucleotide to the levels of 
5 steroid hormones (e.g., testosterone or estrogen) or to other hormones (e.g., growth hormone, 

insulin). Thus expression of specific marker polynucleotides can be used to discriminate between 
normal and cancerous breast tissue, to discriminate between breast cancers with different cells of 
origin, to discriminate between breast cancers with different potential metastatic rates, etc. 

Detection of colon cancer. The polynucleotides of the invention exhibiting the appropriate 

10 expression pattern can be used to detect colon cancer in a subject. Colorectal cancer is one of the 
most common neoplasms in humans and perhaps the most frequent form of hereditary neoplasia. 
Prevention and early detection are key factors in controlling and curing colorectal cancer. 
Colorectal cancer begins as polyps, which are small, benign growths of cells that form on the inner 
lining of the colon. Over a period of several years, some of these polyps accumulate additional 

15 mutations and become cancerous. Multiple familial colorectal cancer disorders have been 
identified, which are summarized as follows: 1) Familial adenomatous polyposis (FAP); 2) 
Gardner's syndrome; 3) Hereditary nonpolyposis colon cancer (HNPCC); and 4) Familial colorectal 
cancer in Ashkenazi Jews. The expression of appropriate polynucleotides of the invention can be 
used in the diagnosis, prognosis and management of colorectal cancer. Detection of colon cancer 

20 can be determined using expression levels of any of these sequences alone or in combination with 
the levels of expression. Determination of the aggressive nature and/or the metastatic potential of a 
colon cancer can be determined by comparing levels of one or more polynucleotides of the 
invention and comparing total levels of another sequence known to vary in cancerous tissue, e.g., 
expression of p53, DCC ras, lor FAP (see, e.g., Fearon ER, et al, Cell (1990) 61(5):159\ Hamilton 

25 SR et al, Cancer (1993) 72:957; Bodmer W, et al, Nat Genet. (1994) 4(3):2\1', Fearon ER, Ann N 
Y Acad Set. (1995) 768:101). For example, development of colon cancer can be detected by 
examining the ratio of any of the polynucleotides of the invention to the levels of oncogenes (e.g. 
ras) or tumor suppressor genes (e.g. FAP or p53). Thus expression of specific marker 
polynucleotides can be used to discriminate between normal and cancerous colon tissue, to 

30 discriminate between colon cancers with different cells of origin, to discriminate between colon 
cancers with different potential metastatic rates, etc. 

Detection of prostate cancer. The polynucleotides and their corresponding genes and gene 
products exhibiting the appropriate differential expression pattern can be used to detect prostate 
cancer in a subject. Over 95% of primary prostate cancers are adenocarcinomas. Signs and 

35 symptoms may include: frequent urination, especially at night, inability to urinate, trouble starting 
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or holding back urination, a weak or interrupted urine flow and frequent pain or stiffness in the 
lower back, hips or upper thighs. 

Many of the signs and symptoms of prostate cancer can be caused by a variety of other 
non-cancerous conditions. For example, one common cause of many of these signs and symptoms 
5 is a condition called benign prostatic hypertrophy, or BPH. In BPH, the prostate gets bigger and 
may block the flow or urine or interfere with sexual function. The methods and compositions of the 
invention can be used to distinguish between prostate cancer and such non-cancerous conditions. 
The methods of the invention can be used in conjunction with conventional methods of diagnosis, 
e.g., digital rectal exam and/or detection of the level of prostate specific antigen (PSA), a substance 

10 produced and secreted by the prostate. 

Use of Polynucleotides to Screen for Peptide Analogs and Antagonists 
Polypeptides encoded by the instant polynucleotides and corresponding full length genes 
can be used to screen peptide libraries to identify binding partners, such as receptors, from among 
the encoded polypeptides. Peptide libraries can be synthesized according to methods known in the 

15 art (see, e.g., USPN 5,010,175 , and WO 91/17823). Agonists or antagonists of the polypeptides if 
the invention can be screened using any available method known in the art, such as signal 
transduction, antibody binding, receptor binding, mitogenic assays, chemotaxis assays, etc. The 
assay conditions ideally should resemble the conditions under which the native activity is exhibited 
in vivo, that is, under physiologic pH, temperature, and ionic strength. Suitable agonists or 

20 antagonists will exhibit strong inhibition or enhancement of the native activity at concentrations 

that do not cause toxic side effects in the subject. Agonists or antagonists that compete for binding 
to the native polypeptide can require concentrations equal to or greater than the native 
concentration, while inhibitors capable of binding irreversibly to the polypeptide can be added in 
concentrations on the order of the native concentration. 

25 Such screening and experimentation can lead to identification of a novel polypeptide 

binding partner, such as a receptor, encoded by a gene or a cDNA corresponding to a 
polynucleotide of the invention, and at least one peptide agonist or antagonist of the novel binding 
partner. Such agonists and antagonists can be used to modulate, enhance, or inhibit receptor 
function in cells to which the receptor is native, or in cells that possess the receptor as a result of 

30 genetic engineering. Further, if the novel receptor shares biologically important characteristics with 
a known receptor, information about agonist/antagonist binding can facilitate development of 
improved agonists/antagonists of the known receptor. 

Pharmaceutical Compositions and Therapeutic Uses 

Pharmaceutical compositions of the invention can comprise polypeptides, antibodies, or 
35 polynucleotides (including antisense nucleotides and ribozymes) of the claimed invention in a 

therapeutically effective amount. The term "therapeutically effective amount" as used herein refers 
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to an amount of a therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or 
to exhibit a detectable therapeutic or preventative effect. The effect can be detected by, for 
example, chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount for a subject will 
5 depend upon the subject's size and health, the nature and extent of the condition, and the 

therapeutics or combination of therapeutics selected for administration. Thus, it is not useful to 
specify an exact effective amount in advance. However, the effective amount for a given situation 
is determined by routine experimentation and is within the judgment of the clinician. For purposes 
of the present invention, an effective dose will generally be from about 0.01 rag/ kg to 50 mg/kg or 

10 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. The 
term "pharmaceutically acceptable carrier" refers to a carrier for administration of a therapeutic 
agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. The term refers to 
any pharmaceutical carrier that does not itself induce the production of antibodies harmful to the 

15 individual receiving the composition, and which can be administered without undue toxicity. 
Suitable carriers can be large, slowly metabolized macromolecules such as proteins, 
polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid 
copolymers, and inactive virus particles. Such carriers are well known to those of ordinary skill in 
the art. Pharmaceutically acceptable carriers in therapeutic compositions can include liquids such 

20 as water, saline, glycerol and ethanol. Auxiliary substances, such as wetting or emulsifying agents, 
pH buffering substances, and the like, can also be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as liquid solutions or suspensions; solid 
forms suitable for solution in, or suspension in, liquid vehicles prior to injection can also be 
prepared. Liposomes are included within the definition of a pharmaceutically acceptable carrier. 

25 Pharmaceutically acceptable salts can also be present in the pharmaceutical composition, e.g., 

mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and 
the salts of organic acids such as acetates, propionates, malonates, benzoates, and the like. A 
thorough discussion of pharmaceutically acceptable excipients is available in Remington 's 
Pharmaceutical Sciences (Mack Pub. Co., NJ. 1991). 

30 Delivery Methods. Once formulated, the compositions of the invention can be 

(1) administered directly to the subject (e.g., as polynucleotide or polypeptides); or (2) delivered ex 
vivo, to cells derived from the subject (e.g., as in ex vivo gene therapy). Direct delivery of the 
compositions will generally be accomplished by parenteral injection, e.g., subcutaneously, 
intraperitoneally, intravenously or intramuscularly, intratumoral or to the interstitial space of a 

35 tissue. Other modes of administration include oral and pulmonary administration, suppositories, 
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and transdermal applications, needles, and gene guns or hyposprays. Dosage treatment can be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject are 
known in the art and described in e.g., International Publication No. WO 93/14778. Examples of 
5 cells useful in ex vivo applications include, for example, stem cells, particularly hematopoetic, 
lymph cells, macrophages, dendritic cells, or tumor cells. Generally, delivery of nucleic acids for 
both ex vivo and in vitro applications can be accomplished by, for example, dextran-mediated 
transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of 
10 the DNA into nuclei, all well known in the art. 

Once a gene corresponding to a polynucleotide of the invention has been found to correlate 
with a proliferative disorder, such as neoplasia, dysplasia, and hyperplasia, the disorder can be 
amenable to treatment by administration of a therapeutic agent based on the provided 
polynucleotide, corresponding polypeptide or other corresponding molecule (e.g., antisense, 
15 ribozyme, etc.). 

The dose and the means of administration of the inventive pharmaceutical compositions are 
determined based on the specific qualities of the therapeutic composition, the condition, age, and 
weight of the patient, the progression of the disease, and other relevant factors. For example, 
administration of polynucleotide therapeutic compositions agents of the invention includes local or 

20 systemic administration, including injection, oral administration, particle gun or catheterized 

administration, and topical administration. Preferably, the therapeutic polynucleotide composition 
contains an expression construct comprising a promoter operably linked to a polynucleotide of at 
least 12, 22, 25, 30, or 35 contiguous nt of the polynucleotide disclosed herein. Various methods 
can be used to administer the therapeutic composition directly to a specific site in the body. For 

25 " example, a small metastatic lesion is located and the therapeutic composition injected several times 
in several different locations within the body of tumor. Alternatively, arteries which serve a tumor 
are identified, and the therapeutic composition injected into such an artery, in order to deliver the 
composition directly into the tumor. A tumor that has a necrotic center is aspirated and the 
composition injected directly into the now empty center of the tumor. The antisense composition is 

30 directly administered to the surface of the tumor, for example, by topical application of the 
composition. X-ray imaging is used to assist in certain of the above delivery methods. 

Receptor-mediated targeted delivery of therapeutic compositions containing an antisense 
polynucleotide, subgenomic polynucleotides, or antibodies to specific tissues can also be used. 
Receptor-mediated DNA delivery techniques are described in, fdr example, Findeis et al, Trends 

35 Biotechnol (1993) 77:202; Chiou et al, Gene Therapeutics: Methods And Applications Of Direct 
Gene Transfer (J.A. Wolff, ed.) (1994); Wu et ah, J. Biol Chem. (1988) 263:621; Wu et aL, J. Biol 
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Chem. (1994) 269:542; Zenke et al., Proc. Natl. Acad. Sci. (USA) (1990) 57:3655; Wu et al. 9 J. 
Biol. Chem. (1991) 266:338. Therapeutic compositions containing a polynucleotide are 
administered in a range of about 100 ng to about 200 mg of DNA for local administration in a gene ' 
therapy protocol. Concentration ranges of about 500 ng to about 50 mg 5 about 1 p.g to about 2 mg, 
5 about 5 [xg to about 500 p.g, and about 20 jig to about 100 jug of DNA can also be used during a 

gene therapy protocol. Factors such as method of action (e.g., for enhancing or inhibiting levels of 
the encoded gene product) and efficacy of transformation and expression are considerations which 
will affect the dosage required for ultimate efficacy of the antisense subgenomic polynucleotides. 
Where greater expression is desired over a larger area of tissue, larger amounts of antisense 

10 subgenomic polynucleotides or the same amounts readministered in a successive protocol of 
administrations, or several administrations to different adjacent or close tissue portions of, for 
example, a tumor site, may be required to effect a positive therapeutic outcome. In all cases, 
routine experimentation in clinical trials will determine specific ranges for optimal therapeutic 
effect. For polynucleotide related genes encoding polypeptides or proteins with anti-inflammatory 

15 activity, suitable use, doses, and administration are described in USPN 5,654,173. 

The therapeutic polynucleotides and polypeptides of the present invention can be delivered 
using gene delivery vehicles. The gene delivery vehicle can be of viral or non- viral origin (see 
generally, Jolly, Cancer Gene Therapy (1994) 7:51; Kimura, Human Gene Therapy (1994) 5:845; 
Connelly, Human Gene Therapy (1995) 7:185; and Kaplitt, Nature Genetics (1994) (J: 148). 

20 Expression of such coding sequences can be induced using endogenous mammalian or heterologous 
promoters. Expression of the coding sequence can be either constitutive or regulated. 

Viral-based vectors for delivery of a desired polynucleotide and expression in a desired cell 
are well known in the art. Exemplary viral-based vehicles include, but are not limited to, 
recombinant retroviruses (see, e.g., WO 90/07936; WO 94/03622; WO 93/25698; WO 93/25234; 

25 " USPN 5, 219,740; WO 93/1 1230; WO 93/10218; USPN 4,777,127; GB Patent No. 2,200,651; EP 0 
345 242; and WO 91/02805), alphavirus-based vectors (e.g., Sindbis virus vectors, Semliki forest 
virus (ATCC VR-67; ATCC VR-1247), Ross River virus (ATCC VR-373; ATCC VR-1246) and 
Venezuelan equine encephalitis virus (ATCC VR-923; ATCC VR-1250; ATCC VR 1249; ATCC 
VR-532), and adeno-associated virus (AAV) vectors (see, e.g., WO 94/12649, WO 93/03769; WO 

30 93/19191; WO 94/28938; WO 95/11984 and WO 95/00655). Administration of DNA linked to 
killed adenovirus as described in Curiel, Hum. Gene Ther. (1992) 3:147 can also be employed. 

Non- viral delivery vehicles and methods can also be employed, including, but not limited 
to, polycationic condensed DNA linked or unlinked to killed adenovirus alone (see, e.g., Curiel, 
Hum. Gene Tlier. (1992) i:147); ligand-linked DNA(see, e.g., Wu, J. Biol. Chem. (1989) 

35 264: 16985); eukaryotic cell delivery vehicles cells (see, e.g., USPN 5,814,482; WO 95/07994; 

WO 96/17072; WO 95/30763; and WO 97/42338) and nucleic charge neutralization or fusion with 
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cell membranes. Naked DNA can also be employed. Exemplary naked DNA introduction methods 
are described in WO 90/1 1092 and USPN 5,580,859. Liposomes that can act as gene delivery 
vehicles are described in USPN 5,422,120; WO 95/13796; WO 94/23697; WO 91/14445; and 
EP 0524968. Additional approaches are described in Philip, MoL Cell Biol. (1994) 74:241 1, and in 
Woffendin, Proa Natl Acad. Set (1994) P/:1581 

Further non-viral delivery suitable for use includes mechanical delivery systems such as the 
approach described in Woffendin et aL, Proa Natl. Acad. Sci. USA (1994) 91(24):! 1581. 
Moreover, the coding sequence and the product of expression of such can be delivered through 
deposition of photopolymerized hydrogel materials or use of ionizing radiation (see, e.g., USPN 
5,206,152 and WO 92/1 1033). Other conventional methods for gene delivery that can be used for 
delivery of the coding sequence include, for example, use of hand-held gene transfer particle gun 
(see, e.g., USPN 5, 149,655); use of ionizing radiation for activating transferred gene (see, e.g., 
USPN 5,206,152 and WO 92/1 1033). 

The present invention will now be illustrated by reference to the following examples which 
set forth particularly advantageous embodiments. However, it should be noted that these 
embodiments are illustrative and are not to be construed as restricting the invention in any way. 

EXAMPLES 

The following examples are offered primarily for purposes of illustration. It will be readily 
apparent to those skilled in the art that the formulations, dosages, methods of administration, and 
other parameters of this invention may be further modified or substituted in various ways without 
departing from the spirit and scope of the invention. 

Example 1: Source of Biological Materials and Overview of Novel Polynucleotides Expressed 
by the Biological Materials 

cDNA libraries were constructed from mRNA isolated from the GRRpz or and WOca cells, 
which were provided by Dr. Donna M. Peehl, Department of Medicine, Stanford University School 
of Medicine. GRRpz cells were primary cells derived from normal prostate epithelium. The WOca 
cells were prostate epithelial cells derived from prostate cancer Gleason Grade 4+4. 
Polynucleotides expressed by these cells were isolated and analyzed; the sequences of these 
polynucleotides were about 275-300 nucleotides in length. 

The sequences of the isolated polynucleotides were first masked to eliminate low 
complexity sequences using the XBLAST masking program (Claverie "Effective Large-Scale 
Sequence Similarity Searches," In: Computer Methods for Macromolecular Sequence Analysis , 
Doolittle, ed., Meth. Enzymol. 266:212-227 Academic Press, NY, NY (1996); see particularly 
Claverie, in "Automated DNA Sequencing and Analysis Techniques" Adams et a/., eds., Chap. 36, 
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p. 267 Academic Press, San Diego, 1994 and Claverie et al Compat. Chem. (1993) 17:191 ). 
Generally, masking does not influence the final search results, except to eliminate sequences of 
relative little interest due to their low complexity, and to eliminate multiple "hits" based on 
similarity to repetitive regions common to multiple sequences, e.g., Alu repeats. The remaining 
5 sequences were then used in a BLASTN vs. GenBank search; sequences that exhibited greater than 
70% overlap, 99% identity, and a p value of less than 1 x 10" 40 were discarded. Sequences from this 
search also were discarded if the inclusive parameters were met, but the sequence was ribosomal or 
vector-derived. 

The resulting sequences from the previous search were classified into three groups (1,2 

10 and 3 below) and searched in a BLASTX vs. NRP (non-redundant proteins) database search: (1) 
unknown (no hits in the GenBank search), (2) weak similarity (greater than 45% identity and p 
value of less than 1 x 10" 5 ), and (3) high similarity (greater than 60% overlap, greater than 80% 
identity, and p value less than 1 x 10" 5 ). Sequences having greater than 70% overlap, greater than 
99% identity, and p value of less than 1 x 10" 40 were discarded. 

15 The remaining sequences were classified as unknown (no hits), weak similarity, and high 

similarity (parameters as above). Two searches were performed on these sequences. First, a 
BLAST vs. EST database search was performed and sequences with greater than 99% overlap, 
greater than 99% similarity and a p value of less than 1 x 10" 40 were discarded. Sequences with a p 
value of less than 1 x 10" 65 when compared to a database sequence of human origin were also 

20 excluded. Second, a BLASTN vs. Patent GeneSeq database was performed and sequences having 
greater than 99% identity, p value less than 1 x 10" 40 , and greater than 99% overlap were discarded. 

The remaining sequences were subjected to screening using other rules and redundancies in 
the dataset. Sequences with a p value of less than 1 x 1 0" 1 11 in relation to a database sequence of 
human origin were specifically excluded. The final result provided the 316 sequences listed as 

25 " SEQ ID NOS:l-316 in the accompanying Sequence Listing and summarized in Table 1 (inserted 
prior to claims). Each identified polynucleotide represents sequence from at least a partial mRNA 
transcript. Many of the sequences include the sequence ggcacgag at the 5' end; this sequence is a 
sequencing artifact and not part of the sequence of the polynucleotides of the invention. 

Table 1 provides: 1) the SEQ ID NO ("SEQ ID") assigned to each sequence for use in the 

30 present specification; 2) the Cluster Identification No. ("CLUSTER"); 3) the sequence name ("SEQ 
NAME") used as an internal identifier of the sequence; 4) the orientation of the sequence 
("ORIENT"); 5) the name assigned to the clone from which the sequence was isolated ("CLONE 
ID"); and the name of the library from which the sequence was isolated ("LIBRARY"). CH22PRC 
indicates the sequence was isolated from Library 22; CH21PRN indicates the sequence was isolated 

35 from Library 21. A description of the libraries is provided in Table 3 below. Because the provided 
polynucleotides represent partial mRNA transcripts, two or more polynucleotides of the invention 
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may represent different regions of the same mRNA transcript and the same gene. Thus, if two or 
more SEQ ID NOS: are identified as belonging to the same clone, then either sequence can be used 
to obtain the full-length mRNA or gene. 

5 Example 2: Results of Public Database Search to Identify Function of Gene Products 
SEQ ID NOS: 1-3 16 were translated in all three reading frames, and the nucleotide 
sequences and translated amino acid sequences used as query sequences to search for homologous 
sequences in either the GenBank (nucleotide sequences) or Non-Redundant Protein (amino acid 
sequences) databases. Query and individual sequences were aligned using the BLAST 2.0 

10 programs, available over the world wide web at a saite sponsored by the National Center for 
Biotechnology Information, which is supported by the National Library of Medicine and the 
National Institutes of Health (see also Altschul, et al. Nucleic Acids Res. (1997) 25:3389-3402). 
The sequences were masked to various extents to prevent searching of repetitive sequences or poly- 
A sequences, using the XBLAST program for masking low complexity as described above in 

15 Example 1. 

Table 2 (inserted before the claims) provide the alignment summaries having a p value of 1 
x 10" 2 or less indicating substantial homology between the sequences of the present invention and 
those of the indicated public databases. Specifically, Table 2 provides the SEQ ID NO of the query 
sequence, the accession number of the GenBank database entry of the homologous sequence, and 

20 the p value of the alignment. Table 2 also provides the SEQ ID NO of the query sequence, the 

accession number of the Non-Redundant Protein database entry of the homologous sequence, and 
the p value of the alignment. The alignments provided in Table 2 are the best available alignment 
to a DNA or amino acid sequence at a time just prior to filing of the present specification. The 
activity of the polypeptide encoded by the SEQ ID NOS listed in Table 2 can be extrapolated to be 

25 " substantially the same or substantially similar to the activity of the reported nearest neighbor or 
closely related sequence. The accession number of the nearest neighbor is reported, providing a 
publicly available reference to the activities and functions exhibited by the nearest neighbor. The 
public information regarding the activities and functions of each of the nearest neighbor sequences 
is incorporated by reference in this application. Also incorporated by reference is all publicly 

30 available information regarding the sequence, as well as the putative and actual activities and 
functions of the nearest neighbor sequences listed in Table 2 and their related sequences. The 
search program and database used for the alignment, as well as the calculation of the p value are 
also indicated. 

Full length sequences or fragments of the polynucleotide sequences of the nearest 
35 neighbors can be used as probes and primers to identify and isolate the full length sequence of the 
corresponding polynucleotide. The nearest neighbors can indicate a tissue or cell type to be used to 
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construct a library for the full-length sequences of the corresponding polynucleotides. 

Example 3 : Differential Expression of Polynucleotides of the Invention: Description of 
Libraries and 

Detection of Differential Expression 

The relative expression levels of the polynucleotides of the invention was assessed in 
several libraries prepared from various sources, including primary cells, cell lines and patient tissue 
samples. Table 3 provides a summary of these libraries, including the shortened library name (used 
hereafter), the mRNA source used to prepared the cDNA library, the "nickname" of the library that 
is used in the tables below (in quotes), and the approximate number of clones in the library. 



Table 3. Description of cDNA Libraries 



Library 
(Lib#) 


Description 


Number of 
Clones in 
Library 


1 


Human Colon Cell Line Kml2 L4: High Metastatic 
Potential (derived from Kml2C) 


308731 


2 


Human Colon Cell Line Kml2C: Low Metastatic Potential 


284771 


3 


Human Breast Cancer Cell Line MDA-MB-23 1 : High 
Metastatic Potential; micro-mets in lung 


326937 


4 


Human Breast Cancer Cell Line MCF7: Non Metastatic 


318979 


8 


Human Lung Cancer Cell Line MV-522: High Metastatic 
Potential 


223620 


9 


Human Lung Cancer Cell Line UCP-3: Low Metastatic 
Potential 


312503 


12 


Human microvascular endothelial cells (HMVEC) - 
UNTREATED (PCR (OligodT) cDNA library) 


41938 


13 


Human microvascular endothelial cells (HMVEC) - bFGF 
TREATED (PCR (OligodT) cDNA library) 


42100 


14 


Human microvascular endothelial cells (HMVEC) - 
VEGF TREATED (PCR (OligodT) cDNA library) 


42825 


15 


Normal Colon - UC#2 Patient (MICRODISSECTED PCR 
(OligodT) cDNA library) 


282722 


16 


Colon Tumor - UC#2 Patient (MICRODISSECTED PCR 
(OligodT) cDNA library) 


298831 


17 


Liver Metastasis from Colon Tumor of UC#2 Patient 


303467 



46 



WO 01/72781 



PCT/US01/09952 



Library 
(Lib#) 


Description 


Number of 
Clones in 
Library 




(MICRODISSECTED PCR (OligodT) cDNA library) 




18 


Normal Colon - UC#3 Patient (MICRODISSECTED PCR 
(OligodT) cDNA library) 


36216 


19 


Colon Tumor - UC#3 Patient (MICRODISSECTED PCR 
(OligodT) cDNA library) 


41388 


20 


Liver Metastasis from Colon Tumor of UC#3 Patient 
(MICRODISSECTED PCR (OligodT) cDNA library) 


30956 


21 


GRRpz Cells derived from normal prostate epithelium 


164801 


22 


WOca Cells derived from Gleason Grade 4 prostate cancer 
epithelium 


162088 


23- 


Normal Lung Epithelium of Patient #1006 
(MICRODISSECTED PCR (OligodT) cDNA library) 


306198 


24 


Primary tumor, Large Cell Carcinoma of Patient #1006 
(MICRODISSECTED PCR (OligodT) cDNA library) 


309349 



The KM12L4 cell line is derived from the KM12C cell line (Morikawa, et al., Cancer 
Research (1988) 48:6863). The KM12C cell line, which is poorly metastatic (low metastatic) was 
established in culture from a Dukes' stage B 2 surgical specimen (Morikawa et al Cancer Res. 
5 (1988) 48:6863). The KML4-A is a highly metastatic subline derived from KM12C (Yeatman et 
al. Nucl. Acids. Res. (1995) 23:4007; Bao-Ling et al. Proc. Annu. Meet. Am. Assoc. Cancer. Res. 
(1995) 27:3269). The KM12C and KM12C-derived cell lines (e.g., KM12L4, KM12L4-A, etc.) are 
well-recognized in the art as a model cell line for the study of colon cancer (see, e.g., Moriakawa et 
al, supra; Radinsky et al Clin. Cancer Res. (1995) 7:19; Yeatman et al, (1995) supra; Yeatman et 
10 al Clin. Exp. Metastasis (1996) 14:246). The MDA-MB-231 cell line (Brinkley et al. Cancer Res. 
(1980) 40:31 18-3129) was originally isolated from pleural effusions (Cailleau, J. Natl Cancer. 
Inst. (1974) 53:661), is of high metastatic potential, and forms poorly differentiated 
adenocarcinoma grade II in nude mice consistent with breast carcinoma. 

The MCF7 cell line was derived from a pleural effusion of a breast adenocarcinoma and is 
.15 non-metastatic. The MV-522 cell line is derived from a human lung carcinoma and is of high 

metastatic potential. The UCP-3 cell line is a low metastatic human lung carcinoma cell line; the 
MV-522 is a high metastatic variant of UCP-3. These cell lines are well-recognized in the art as 
models for the study of human breast and lung cancer (see, e.g., Chandrasekaran et al, Cancer Res. 
(1979) 3P:870 (MDA-MB-231 and MCF-7); Gastpar et al, J Med Chem (1998) 47:4965 (MDA- 

47 



WO 01/72781 



PCT/US01/09952 



MB-231 and MCF-7); Ranson et al, Br J Cancer (1998) 77:1586 (MDA-MB-231 and MCF-7); 
Kuang et al, Nucleic Acids Res (1998) 25:1116 (MDA-MB-23 1 and MCF-7); Varki et al, hit J 
Cancer (1987) 40:46 (UCP-3); Varki et al, Tumour Biol (1990) 77:327; (MV-522 and UCP-3); 
Varki et al, Anticancer Res. (1990) 70:637; (MV-522); Kelner et al, Anticancer Res (1995) 7.5:867 
(MV-522); and Zhang et al, Anticancer Drugs (1997) 5:696 (MV522)). The samples of libraries 
15-20 are derived from two different patients (UC#2, and UC#3). The bFGF-treated HMVEC were 
prepared by incubation with bFGF at lOng/ml for 2 hrs; the VEGF-treated HMVEC were prepared 
by incubation with 20ng/ml VEGF for 2 hrs. Following incubation with the respective growth 
factor, the cells were washed and lysis buffer added for RNA preparation. The GRRpz and WOca 
cells were provided by Dr. Donna M. Peehl, Department of Medicine, Stanford University School 
of Medicine. GRRpz cells were derived from normal prostate epithelium. The WOca cells are 
Gleason Grade 4 cell line. 

Each of the libraries is composed of a collection of cDNA clones that in turn are 
representative of the mRNAs expressed in the indicated mRNA source. In order to facilitate the 
analysis of the millions of sequences in each library, the sequences were assigned to clusters. The 
concept of "cluster of clones" is derived from a sorting/grouping of cDNA clones based on their 
hybridization pattern to a panel of roughly 300 7bp oligonucleotide probes (see Drmanac et al, 
Genomics (1996) 37(1):29). Random cDNA clones from a tissue library are hybridized at moderate 
stringency to 300 7bp oligonucleotides. Each oligonucleotide has some measure of specific 
hybridization to that specific clone. The combination of 300 of these measures of hybridization for 
300 probes equals the "hybridization signature" for a specific clone. Clones with similar sequence 
will have similar hybridization signatures. By developing a sorting/grouping algorithm to analyze 
these signatures, groups of clones in a library can be identified and brought together 
computationally. These groups of clones are termed "clusters". Depending on the stringency of the 
selection in the algorithm (similar to the stringency of hybridization in a classic library cDNA 
screening protocol), the "purity" of each cluster can be controlled. For example, artifacts of 
clustering may occur in computational clustering just as artifacts can occur in "wet-lab" screening 
of a cDNA library with 400 bp cDNA fragments, at even the highest stringency. The stringency 
used in the implementation of cluster herein provides groups of clones that are in general from the 
same cDNA or closely related cDNAs. Closely related clones can be a result of different length 
clones of the same cDNA, closely related clones from highly related gene families, or splice 
variants of the same cDNA. 

Differential expression for a selected cluster was assessed by first determining the number 
of cDNA clones corresponding to the selected cluster in the first library (Clones in 1 st ), and the 
determining the number of cDNA clones corresponding to the selected cluster in the second library 
(Clones in 2 nd ). Differential expression of the selected cluster in the first library relative to the 
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second library is expressed as a "ratio" of percent expression between the two libraries. In general, 
the "ratio" is calculated by: 1) calculating the percent expression of the selected cluster in the first 
library by dividing the number of clones corresponding to a selected cluster in the first library by 
the total number of clones analyzed from the first library; 2) calculating the percent expression of 
5 the selected cluster in the second library by dividing the number of clones corresponding to a 

selected cluster in a second library by the total number of clones analyzed from the second library; 
3) dividing the calculated percent expression from the first library by the calculated percent 
expression from the second library. If the "number of clones" corresponding to a selected cluster in 
a library is zero, the value is set at 1 to aid in calculation. The formula used in calculating the ratio 

10 takes into account the "depth" of each of the libraries being compared, i.e., the total number of 
clones analyzed in each library. 

In general, a polynucleotide is said to be significantly differentially expressed between two 
samples when the ratio value is greater than at least about 2, preferably greater than at least about 3, 
more preferably greater than at least about 5 , where the ratio value is calculated using the method 

15 described above. The significance of differential expression is determined using a z score test (Zar, 
Biostatistical Analysis , Prentice Hall, Inc., USA, "Differences between Proportions," pp 296-298 
(1974). 

Using this approach, a number of polynucleotide sequences were identified as being 
differentially expressed between, for example, cells derived from high metastatic potential cancer 

20 tissue and low metastatic cancer cells, and between cells derived from metastatic cancer tissue and 
normal tissue. Evaluation of the levels of expression of the genes corresponding to these sequences 
can be valuable in diagnosis, prognosis, and/or treatment {e.g., to facilitate rationale design of 
therapy, monitoring during and after therapy, etc.). Moreover, the genes corresponding to 
differentially expressed sequences described herein can be therapeutic targets due to their 

25 " involvement in regulation {e.g., inhibition or promotion) of development of, for example, the 
metastatic phenotype. For example, sequences that correspond to genes that are increased in 
expression in high metastatic potential cells relative to normal or non-metastatic tumor cells may 
encode genes or regulatory sequences involved in processes such as angiogenesis, differentiation, 
cell replication, and metastasis. 

30 Detection of the relative expression levels of differentially expressed polynucleotides 

described herein can provide valuable information to guide the clinician in the choice of therapy. 
For example, a patient sample exhibiting an expression level of one or more of these 
polynucleotides that corresponds to a gene that is increased in expression in metastatic or high 
metastatic potential cells may warrant more aggressive treatment for the patient. In contrast, 

35 detection of expression levels of a polynucleotide sequence that corresponds to expression levels 
associated with that of low metastatic potential cells may warrant a more positive prognosis than 
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the gross pathology would suggest. 

The differential expression of the polynucleotides described herein can thus be used as, for 
example, diagnostic markers, prognostic markers, for risk assessment, patient treatment and the 
like. These polynucleotide sequences can also be used in combination with other known molecular 
5 and/or biochemical markers. 

The differential expression data for polynucleotides of the invention that have been 
identified as being differentially expressed across various combinations of the libraries described 
above is summarized in Table 4 (inserted prior to the claims). Table 4 provides: 1) the Sequence 
Identification Number ("SEQ ID") assigned to the polynucleotide; 2) the cluster ("CLUST") to 

10 which the polynucleotide has been assigned as described above; 3) the library comparisons that 
resulted in identifcation of the polynucleotide as being differentially expressed ("PairAB-text"), 
with shorthand names of the compared libraries provided in parentheses following the library 
numbers; 4) the number of clones corresponding to the polynucleotide in the first library listed 
("A"); 5) the number of clones corresponding to the polynucleotide in the second library listed 

15 ("B"); 6) the "RATIO PLUS" where the comparison resulted in a finding that the number of clones 
in library A is greater than the number of clones in library B; and 7) the "RATIO MINUS" where 
the comparison resulted in a finding that the number of clones in library B is greater than the 
number of clones in library A. 

20 Example 4: Differential Expression of a Polynucleotides Associated with Metastatic Potential in 
Breast Cancer 

Differential expression was examined in breast cancer cells having either high metastatic 
potential or low metastatic potential. A single cluster, Cluster Identification No. 10154, was 
identified as displaying low expression in the high metastatic potential breast cancer cells (Library 

25 • 3), and significantly increased expression - approximately 100-fold higher — in the low metastatic 
potential cells (Library 4). Specifically, three clones were identified that were expressed in Library 
3, the high metastatic potential breast cancer library, while 317 clones were expressed in Library 4, 
the low metastatic potential breast cancer library. The two sequences assigned to this particular 
cluster, SEQ ID NO:315 and SEQ ID NO:316, both displayed this differential expression, 

30 suggesting that the two sequences are likely associated with a single transcript. 

SEQ ID NO:3 15 and SEQ ID NO:3 16 were then used as query sequences to search for 
homologous sequences in GenBank as described in Examples 1 and 2. SEQ ID NO: 315 displayed 
identity to the GenBank entry H72034 (SEQ ID NO:317) and SEQ ID NO:316 displayed identity to 
GenBank entry AA707002 (SEQ ID NO:318). SEQ ID NO:315 displays striking identity to the 3' 

35 end of SEQ ID NO:317 (See Figures 1A and IB), while SEQ ID NO:316 displays striking identity 
to the 5' end of SEQ ID NO:3 18 (See Figure 2). Clones of H72034 and AA707002 were ordered 
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from the I.M A.G.E. Consortium at the Lawrence Livermore National Laboratories (Livermore, 
California) for further studies. 

Restriction Mapping of Clones H72034 and AA707002 

The newly identified sequences were digested with a number of different restriction 
5 endonucleases to construct a restriction map of each of the clones. An appropriate amount of each 
clone, SEQ ID NO:317 or SEQ ID NO:318, was digested with various enzymes, and the restriction 
fragments identified as follows: 
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SEQ ID NO:317 



10 



15 



20 



25 



30 



35 



Enzyme 




#Cuts 


Positions 








AM 


5 


331 


1029 


1422 


1595 


1977 




BamHI 


2 


1836 


2089 










BstEII 


1 


936 












BstXI 


1 


1033 












Haelll 


12 


145 


300 


453 


497 


582 


780 






1102 


1536 


1561 


1722 


1981 


2062 


Hinfl 


12 


5 


154 


205 


325 


397 


473 


968 






1295 


1426 


2066 






Kpnl 


1 


1938 












Mspl 


6 


78 


739 


1098 


2038 


2077 


2093 


Ncol 


2 


2013 


2058 










PstI 


1 


1501 












PvuII 


2 


331 


1422 










Sau3AI . 


6 


1270 


1813 


1819 


1836 


1894 


2089 


SphI 


1 


1870 












Xhol 


1 


1413 












SEQIDNO:318 














Enzyme 


#Cuts 


Positions 










AM 


9 


19 


245 


367 


553 


586 


874 


1214 
















BamHI 


1 


407 












Bgll 


1 


1056 












Bgin 


1 


475 












BstEI 


1 


1108 












Haelll 


10 


153 


348 


485 


867 


518 


628 


915 






1016 


1312 








Hindlll 


2 


243 


872 










Hinfl 


1 


1353 












Kpnl 


1 


132 












Mspl 


2 


1196 


1261 










PstI 


1 


823 












PvuII 


1 


996 












Sau3AI 


7 


66 


407 


475 


504 


750 


850 



610 820 



40 



The restriction maps based on the identified sites can be used to determine the position of 
each clone relative to the genomic sequences, and to confirm the 5 '-3 'orientation of the clones. 



Amplification and Purification of Transcript 

A transcript in this region upregulated in low metastatic cancers which contain sequences 
from SEQ ID NOS: 315-318 is identified using a technique such as polymerase chain reaction 
45 (PCR) amplification. Based on the sequences identified and the original sequences of the cluster, 
primers can be designed to isolate the full length cDNA from a library constructed from the breast 
cancer cell line with low metastatic potential. 
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A cDNA template for use in the amplification reaction is generated from total RNA 
isolated from the high metastatic breast cell line. RNA is reverse transcribed using oligo-dT primer 
to generate first strand cDNA. cDNA is synthesized by denaturing 3(il of total RNA, 2 \xl oligo- 
dT primer at 20 [iM, and 5 [il DEPC water for 8 minutes at 65°C followed by reverse transcription 
5 at 52°C for 1 hour in a reaction containing the denatured RN A/primer plus 4|il 1 5X cDNA buffer 
(GibcoBRL), 1 \xl 0.1 M dithiothreitol, 1 |il 40 U/ 1 RNAseOUT (GibcoBRL), 1 p,l DEPC water, 
2 10 mM dNTP (GibdoBRL), and 1 |al 15 U/ 1 Thermoscript reverse transcriptase (GibcoBRL). 
The reaction was terminated by a 5-min incubation at 85°C, and the RNA was removed by 1 |xl 2 
U/ 1 RNAse H at 37°C for thirty minutes. 
10 Based on the determined orientation of the clones, primers are designed to amplify a full- 

length clone corresponding to the differentially expressed transcript in this region. Forward primers 
that are used to amplify the full-length clone are taken from the 5' end of SEQ ID NO: 17 as 
follows: 

15 Fl 5'- TGGGATATAGTCTCGTGGTGCG -3' (SEQ ID NO:319) 

F2 5'- TGATTCGATGTCATCAGTCCCG-3 ' (SEQ ID NO:320) 

Primer Fl is taken from residues 51-62 of SEQ ID NO: 317, and primer F2 is taken from residues 
212-233 Of SEQ ID NO:17. Both forward primers are near the 5' end of this sequence. 
20 Reverse Primers are designed using sequences complementary to the 3' end of clone 

10154-3 as follows: 

Rl 5'- TGTGTCACAGCCAGACATGAGC (SEQ ID NO:321) 
R2 5 ' - TGCAAACATACACAGGGACCG (SEQ ID NO:322) 

25 " 

Primer Rl is based on residues 573-552 of SEQ ID NO:318, and R2 is based on residues 399-379 
ofSEQIDNO:318. 
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PCR is performed using a 5|il aliquot of the first strand cDNA synthesis reaction, and a 
primer pair, e.g., Fl and Rl, Fl and R2, F2 and Rl, or F2 and R2. An open reading frame is 
amplified using 2 [il of the reverse transcription product as template in a PCR reaction containing 5 
|il of lOx PCR buffer (GibcoBRL), 1 (il 50 mM Mg 2 S0 4 , 1 fil 10 mM dNTP, 1 [il Fl or F2 primer, 
5 1 jul Rl primer, 2.5 U High Fidelity Platinum Taq DNA polymerase (GibcoBRL), and water to 50 
The molecule is amplified using 30 rounds of amplification in a thermal cycler at the following 
temperatures: 1 minute at 95°C; 1 minute at 55°C and 2 minutes at 72°C. The 30 cycles was 
followed by a 10 minute extension at 72°C. 

Following amplification of the sequences, the PCR products are loaded on a 1% TEA gel 

10 and subjected to gel purification. One or more bands can be isolated from the gel and the DNA was 
purified using a QIAquick® Gel Extraction Kit (Qiagen, Valencia, CA). The purified fragment was 
cloned into a bacterial vector and transformed into the bacterial strain DHScc. Following cloning of 
the purified fragment(s), the DNA can be isolated and sequenced to confirm that a band 
corresponds to a transcript from this genetic region. 

15 The reactions are carried out with two different 5' and 3 ' primers to increase the likelihood 

that the reaction will yield an amplification product. Other primers may also be designed from the 
predicted 5' and/or 3' end of the sequence, as will be apparent to one skilled in the art upon reading 
this disclosure, and thus other primers may be designed from the general region of SEQ ID 
NOS:3 17 and 318 that may yield better results than the disclosed primers. 

20 In order to obtain additional sequences 5 f to the end of a partial cDNA, 5 f rapid 

amplification of cDNA ends (RACE) can be performed to ensure that the entire transcript has been 
identified. See PCR Protocols: A Guide to Methods and Applications, (1990) Academic Press, Inc. 
Following isolation of a cDNA using the Fl-Rl or F2-R1 primer pairs, additional primers can be 
designed to perform RACE. The primers can be designed from the sequence of 10154-1 as 

25 follows: 

5 '-TTTAGCAGC ACTAATG ACTGTGGC-3 1 (SEQ ID NO:323) 
5'-CGCCGTGAATTACTGTGGATGG-3' (SEQ ID NO:324) 

30 The two RACE primers are designed based residues 286-263 and 396-375 of SEQ ID NO:3 17, 
respectively. 

These sequences can be used to obtain any transcript sequences 5' to the amplification products 
obtained using the PCR protocol described above. 

35 Northern Analysis 

Other techniques can be used for confirming differential expression of the full-length 
transcript. For example, a Northern Blot can be used to verify differential expression of SEQ ID 

54 



WO 01/72781 



PCT/US01/09952 



NOS:317 and 31 8 in a breast cancer cells with low metastatic potential compared to breast cancer 
cells with high metastatic potential. Northern analysis can be accomplished by methods well-known 
in the art. Briefly, RNA is individually isolated from breast cancer cells having high metastatic 
potential and breast cancer cells having low metastatic potential, e.g., a product such as RNeasy 
5 Mini Kits (Qiagen, CA) or NucleoSpin® RNA II Kit (Clontech, Palo Alto, CA). The isolated 
RNA samples are For Northern analysis, RNA isolated from the cells was electrophoresed on a 
denaturing formaldehyde agarose gel and transferred onto a membrane such as a supported 
nitrocellulose membrane (Schleicher & Schuell). 

Rapid-Hyb buffer (Amersham Life Science, Little Chalfont, England) with 5 mg/ml 

10 denatured single stranded sperm DNA is pre-warmed to 65°C and the RNA blots are pre-hybridized 
in the buffer with shaking at 65°C for 30 minutes. Gene-specific DNA probes (50 ng per reaction) 
labeled with [oc- 32 P]dCTP (3000Ci/mmol, Amersham Pharmacia Biotech Inc., Piscataway, NJ) 
(Prime-It RmT Kit, Stratagene, La Jolla, CA) and purified with ProbeQuant™ G-50 Micro 
Columns (Amersham Pharmacia Biotech Inc.) are added and hybridized to the blots with shaking at 

15 65°C for overnight. The blots are washed in 2x SSC, 0.1%(w/v) SDS at room temperature for 20 
minutes, twice in lx SSC, 0.1%(w/v) SDS at 65°C for 15 minutes, then exposed to Hyperfilms 
(Amersham Life Science). 

Example 6: Identification of Differentially Expressed Genes by Array Analysis with Patient Tissue 
20 Samples 

Differentially expressed genes corresponding to the polynucleotides described herein were 
also identified by microarray hybridization analysis using materials obtained from patient tissue 
samples. The biological materials used in these experiments are described below. 
Source of patient tissue samples 

25 Normal and cancerous tissues were collected from patients using laser capture 

microdissection (LCM) techniques, which techniques are well known in the art (see, e.g., Ohyama 
et al (2000) Biotechniques 29:530-6; Curran et al (2000) Mol Pathol 53:64-8; Suarez-Quian et 
al (1999) Biotechniques 26:328-35; Simone et al (1998) Trends Genet 14:272-6; Conia et al 
(1997)J. Clin. Lab. Anal 11:28-38; Emmert-Buck e/ al (1996) SWewce 274:998-1001). Table 8 

30 (inserted following the last page of the Examples ) provides information about each patient from 

which the samples were isolated, including: the Patient ID and Path ReportID, numbers assigned to 
the patient and the pathology reports for identification purposes; the anatomical location of the 
tumor (AnatomicalLoc); The Primary Tumor Size; the Primary Tumor Grade; the Histopathologic 
Grade; a description of local sites to which the tumor had invaded (Local Invasion); the presence of 

35 lymph node metastases (Lymph Node Metastasis); incidence of lymph node metastases (provided 
as number of lymph nodes positive for metastasis over the number of lymph nodes examined) 
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(Incidence Lymphnode Metastasis); the Regional Lymphnode Grade; the identification or detection 
of metastases to sites distant to the tumor and their location (Distant Met & Loc);a description of 
the distant metastases (Description Distant Met); the grade of distant metastasis (Distant Met 
Grade); and general comments about the patient or the tumor (Comments). Adenoma was not 
5 described in any of the patients. ; adenoma dysplasia (described as hyperplasia by the pathologist) 
was described in Patient ID No. 695. Extranodal extensions were described in two patients, Patient 
ID Nos. 784 and 791. Lymphovascular invasion was described in seven patients, Patient ID Nos. 
128, 278, 517, 534, 784, 786, and 791.. Crohn's-like infiltrates were described in seven patients, 
Patient ID Nos. 52, 264, 268, 392, 393, 784, and 791. 
10 Source of polynucleotides on arrays 

Polynucleotides on arrays 

Polynucleotides spotted on the arrays were generated by PCR amplification of clones 
derived from cDNA libraries. The clones used for amplification were either the clones from which 
the sequences described herein (SEQ ID NOS: 1-3 16) were derived, or are clones having inserts 
15 with significant polynucleotide sequence overlap with the sequences described herein (SEQ ID 
NO: 1-3 16) as determined by BLAST2 homology searching. 

Microarray Design 

Each array used in the examples below had an identical spatial layout and control spot set. 
Each microarray was divided into two areas, each area having an array with, on each half, twelve 

20 groupings of 32 x 12 spots for a total of about 9,216 spots on each array. The two areas are spotted 
identically which ^provide for at least two duplicates of each clone per array. Spotting was 
accomplished using PCR amplified products from 0.5kb to 2.0 kb and spotted using a Molecular 
Dynamics Gen III spotter according to the manufacturer's recommendations. The first row of each 
of the 24 regions on the array had about 32 control spots, including 4 negative control spots and 8 

25 " test polynucleotides. 

The test polynucleotides were spiked into each sample before the labeling reaction with a 
range of concentrations from 2-600 pg/slide and ratios of 1 : 1 . For each array design, two slides 
were hybridized with the test samples reverse-labeled in the labeling reaction. This provided for 
about 4 duplicate measurements for each clone, two of one color and two of the other, for each 

30 sample. 

Microarray analysis 

cDNA probes were prepared from total RNA isolated from the patient cells described in 
above (Table 8). Since LCM provides for the isolation of specific cell types to provide a 
substantially homogenous cell sample, this provided for a similarly pure RNA sample. 
35 Total RNA was first reverse transcribed into cDNA using a primer containing a*T7 RNA 

polymerase promoter, followed by second strand DNA synthesis. cDNA was then transcribed in 
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vitro to produce antisense RNA using the T7 promoter-mediated expression (see, e.g., Luo et al. 
(1999) Nature Med 5:1 17-122), and the antisense RNA was then converted into cDNA. The 
second set of cDNAs were again transcribed in vitro, using the T7 promoter, to provide antisense 
RNA. Optionally, the RNA was again converted into cDNA, allowing for up to a third round of 
5 T7-mediated amplification to produce more antisense RNA. Thus the procedure provided for two 
or three rounds of in vitro transcription to produce the final RNA used for fluorescent labeling. 
Fluorescent probes were generated by first adding control RNA to the antisense RNA mix, and 
producing fluorescently labeled cDNA from the RNA starting material. Fluorescently labeled 
cDNAs prepared from the tumor RNA sample, were compared to fluorescently labeled cDNAs 

10 prepared from normal cell RNA sample. For example, the cDNA probes from the normal cells 

were labeled with Cy3 fluorescent dye (green) and the cDNA probes prepared from the tumor cells 
were labeled with Cy5 fluorescent dye (red). 

The differential expression assay was performed by mixing equal amounts of probes from 
tumor cells and normal cells of the same patient. The arrays were prehybridized by incubation for 

15 about 2 hrs at 60°C in 5X SSC/0.2% SDS/1 mM EDTA, and then washed three times in water and 
twice in isopropanol. Following prehybridization of the array, the probe mixture was then 
hybridized to the array under conditions of high stringency (overnight at 42°C in 50% formamide, 
5X SSC, and 0.2% SDS. After hybridization, the array was washed at 55°C three times as follows: 
1) first wash in IX SSC/0.2% SDS; 2) second wash in 0.1X SSC/0.2% SDS; and 3) third wash in 

20 0.1XSSC. 

The arrays were then scanned for green and red fluorescence using a Molecular Dynamics 
Generation III dual color laser-scanner/detector. The images were processed using BioDiscovery 
Autogene software, and the data from each scan set normalized to provide for a ratio of expression 
relative to normal. Data from the microarray experiments was analyzed according to the algorithms 

25 " described in U.S. application serial no. 60/252,358, filed November 20, 2000, by E.J. Moler, M.A. 
Boyle, and F.M. Randazzo, and entitled "Precision and accuracy in cDNA microarray data," which 
application is specifically incorporated herein by reference. 

The experiment was repeated, this time labeling the two probes with the opposite color in 
order to perform the assay in both "color directions." Each experiment was sometimes repeated 

30 with two more slides (one in each color direction). The level fluorescence for each sequence on the 
array expressed as a ratio of the geometric mean of 8 replicate spots/genes from the four arrays or 4 
replicate spots/gene from 2 arrays or some other permutation. The data were normalized using the 
spiked positive controls present in each duplicated area, and the precision of this normalization was 
included in the. final determination of the significance of each differential. The fluorescent intensity 

35 of each spot was also compared to the negative controls in each duplicated area to determine which 
spots have detected significant expression levels in each sample. 
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A statistical analysis of the fluorescent intensities was applied to each set of duplicate spots 
to assess the precision and significance of each differential measurement, resulting in a p-value 
testing the null hypothesis that there is no differential in the expression level between the tumor 
and normal samples of each patient. For initial analysis of the microarrays, the hypothesis was 
accepted if p>10~ 3 , and the differential ratio was set to 1.000 for those spots. All other spots have a 
significant difference in expression between the tumor and normal sample. If the tumor sample has 
detectable expression and the normal does not, the ratio is truncated at 1000 since the value for 
expression in the normal sample would be zero, and the ratio would not be a mathematically useful 
value (e.g., infinity). If the normal sample has detectable expression and the tumor does not, the 
ratio is truncated to 0.001, since the value for expression in the tumor sample would be zero and the 
ratio would not be a mathematically useful value. These latter two situations are referred to herein 
as "on/off." Database tables were populated using a 95% confidence level (p>0.05). 

Table 9 below summarize the results of the differential expression analysis. Each table 
provides: the SEQ ID NO of the polynucleotide corresponding to the polynucleotide on the spot on 
the array; the Spot ID (an identifier assigned to the spot so as to distinguish it from spots on the 
same and different arrays), the number of patients for whom there was information obtained from 
the array (Num Ratios), and the percentage of patients in which expression was detected at greater 
than or equal to a two-fold increase (>=2x), greater than or equal to a five-fold increase (>=5x), or 
less than or equal to a 1/2 -fold decrease (<=halfx) relative to matched normal control tissue. 

In general, a polynucleotide is said to represent a significantly differentially expressed gene 
between two samples when there is detectable levels of expression in at least one sample and the 
ratio value is greater than at least about 1 .2 fold, preferably greater than at least about 1 .5 fold , 
more preferably greater than at least about 2 fold, where the ratio value is calculated using the 
method described above. 

A differential expression ratio of 1 indicates that the expression level of the gene in the 
tumor cell was not statistically different from expression of that gene in normal colon cells of the 
same patient. A differential expression ratio significantly greater than 1 in cancerous colon cells 
relative to normal colon cells indicates that the gene is increased in expression in cancerous cells 
relative to normal cells, indicating that the gene plays a role in the development of the cancerous 
phenotype, and may be involved in promoting metastasis of the cell. Detection of gene products 
from such genes can provide an indicator that the cell is cancerous, and may provide a therapeutic 
and/or diagnostic target. 

Likewise, a differential expression ratio significantly less than 1 in cancerous colon cells 
relative to normal colon cells indicates that, for example, the gene is involved in suppression of the 
cancerous phenotype. Increasing activity of the gene product encoded by such a gene, or replacing 
such activity, can provide the basis for chemotherapy. Such gene can also serve as markers of 
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cancerous cells, e.g., the absence or decreased presence of the gene product in a colon cell relative 
to a normal colon cell indicates that the cell may be cancerous. 



Table 9. 



SEQ ID NO: 


SpotID 


Num 
Ratios 


>=2x 


>=5x 


<=balfx 


8 


579 


33 


87.88 


39.39 


3.03 


12 


22300 


33 


33.33 


18.18 


6.06 j 


26 


21886 


33 


33.33 


0.00 


3.03 


64 


9487 


33 


33.33 


12.12 


3.03 


248 


28179 


28 


32.14 


0.00 


0.00 


253 


28179 


28 


32.14 


0.00 


0.00 


272 


28179 


28 


32.14 


0.00 


0.00 


292 


9111 


33 


33.33 


18.18 


3.03 


295 


19980 


33 


33.33 


6.06 


0.00 


309 


23993 


33 


42.42 


3.03 


3.03 



Deposit Information . The following materials were deposited with the American Type 
Culture Collection (CMCC = Chiron Master Culture Collection). 



Table 5. Cell Lines Deposited with ATCC 



Cell Line 


Deposit Date 


ATCC Accession No. 


CMCC Accession 
No. 


KM12L4-A 


March 19, 1998 


CRL- 12496 


11606 


Kml2C 


May 15, 1998 


CRL- 12533 


11611 


MDA-MB-231 


May 15, 1998 


CRL- 12532 


10583 


MCF-7 


October 9, 1998 


CRL- 125 84 


10377 



In addition, pools of selected clones, as well as libraries containing specific clones, were 



assigned an "ES" number (internal reference) and deposited with the ATCC. Table 6 below 
provides the ATCC Accession Nos. of the ES deposits, all of which were deposited on or before 
May 13, 1999. The names of the clones contained within each of these deposits are provided in the 
Table 7 (inserted before the claims). 



Table 6: Pools of Clones and Libraries Deposited with ATCC on or before March 28, 2000 



Cell Line 


CMCC 


ATCC 


ES75 


5140 


PTA-1102 


ES76 


5141 


PTA-1103 


ES77 


5142 


PTA-1104 


ES78 


5143 


PTA-1105 


ES79 


5144 


PTA-1106 


ES80 


5145 


PTA-1107 


ES81 


5146 


PTA-1 108 


ES82 


5147 


PTA-1109 


ES83 


5148 


PTA-1 110 


ES84 


5149 


PTA-1 111 
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The deposits described herein are provided merely as convenience to those of skill in the 
art, and is not an admission that a deposit is required under 35 U.S.C. §112. The sequence of the 
polynucleotides contained within the deposited material, as well as the amino acid sequence of the 
polypeptides encoded thereby, are incorporated herein by reference and are controlling in the event 
5 of any conflict with the written description of sequences herein. A license may be required to 
make, use, or sell the deposited material, and no such license is granted hereby. 

Retrieval of Individual Clones from Deposit of Pooled Clones . Where the ATCC deposit is 
composed of a pool of cDNA clones or a library of cDNA clones, the deposit was prepared by first 

10 transfecting each of the clones into separate bacterial cells. The clones in the pool or library were 
then deposited as a pool of equal mixtures in the composite deposit. Particular clones can be 
obtained from the composite deposit using methods well known in the art. For example, a bacterial 
cell containing a particular clone can be identified by isolating single colonies, and identifying 
colonies containing the specific clone through standard colony hybridization techniques, using an 

15 oligonucleotide probe or probes designed to specifically hybridize to a sequence of the clone insert 
(e.g., a probe based upon unmasked sequence of the encoded polynucleotide having the indicated 
SEQ ID NO). The probe should be designed to have a T m of approximately 80°C (assuming 2°C 
for each A or T and 4°C for each G or C). Positive colonies can then be picked, grown in culture, 
and the recombinant clone isolated. Alternatively, probes designed in this manner can be used to 

20 PGR to isolate a nucleic acid molecule from the pooled clones according to methods well known in 
the art, e.g., by purifying the cDNA from the deposited culture pool, and using the probes in PCR 
reactions to produce an amplified product having the corresponding desired polynucleotide 
sequence. 

Those skilled in the art will recognize, or be able to ascertain, using not more than routine 
25 " experimentation, many equivalents to the specific embodiments of the invention described herein. 
Such specific embodiments and equivalents are intended to be encompassed by the following 
claims. 

All publications and patent applications cited in this specification are herein incorporated 
by reference as if each individual publication or patent application were specifically and 
30 individually indicated to be incorporated by reference. The citation of any publication is for its 
disclosure prior to the filing date and should not be construed as an admission that the present 
invention is not entitled to antedate such publication by virtue of prior invention. 

Although the foregoing invention has been described in some detail by way of illustration 
and example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill 
35 in the aft in light of the teachings of this invention that certain changes and modifications may be 
made thereto without departing from the spirit or scope of the appended claims. 
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Table 1 



SEQ 
ID 


CLUSTER 


SEQ NAME 


ORIENT 


CLONE ID 


LIBRARY 


1 


819545 


RTA22200265F.k.06. 1 .P.Seq 


F 


M00064554D:A03 


CH22PRC 


2 


377944 


RTA22200251F.j.02. 1 .P.Seq 


F 


M00063482A:A08 


CH21PRN 


3 


818497 


RTA22200252F.a. 13.1 .P.Seq 


F 


M00063514C:D03 


CH21PRN 


4 


819498 


RTA22200252F.n.05. 1 .P.Seq 


F 


M00063638C:G12 


CH21PRN 


5 


455465 


RTA22200264F.e. 16.1 .P.Seq 


F 


M00064454A:H10 


CH22PRC 


6 


819069 


RTA22200255F.f.01 . 1 .P.Seq 


F 


M00063940D:F09 


CH21PRN 


7 


672003 


RTA22200265F.b.O9. 1 .P.Seq 


F 


M00064517C:F11 


CH22PRC 


8 


728115 


RTA22200253F.O.24. 1 .P.Seq 


F 


M00063838B:GO8 


CH21PRN 


9 


372700 


RTA22200260F.b.20. 1 .P.Seq 


F 


M00063580C:A06 


CH22PRC 


10 


818056 


RTA22200266F.C. 13.1 .P.Seq 


F 


M00064593D:C01 


CH22PRC 


11 


818497 


RTA22200255F.a. 17. 1 .P.Seq 


F 


M00063920DH02 


CH21PRN 


. 12 


729832 


RTA22200267F.1.21 . 1 .P.Seq 


F 


M00064714A:G03 


CH22PRC 


13 


505514 


RTA22200251F.b.21 . 1 .P.Seq 


F 


M00063158A:A01 


CH21PRN 


14 


376488 


RTA22200254F.C.05. 1 .P.Seq 


F 


M00063852B:DO8 


CH21PRN 


15 


376488 


RTA22200260F.b.09. 1 .P.Seq 


F 


M00063578C:A06 


CH22PRC 


16 


748572 


RTA22200254F.C.07. 1 .P.Seq 


F 


M00063852D:F07 


CH21PRN 


17 


549934 


RTA22200253F.k. 18.1 .P.Seq 


F 


M00063801B:D04 


CH21PRN 


18 


819069 


RTA22200255F.e.24. 1 .P.Seq 


F 


M00063940D:F09 


CH21PRN 


19 


817618 


RTA22200253F.n. 16.1 .P.Seq 


F 


M00063828D:E05 


CH21PRN 


20 


124396 


RTA22200263F.a. 1 1 .2.P.Seq 


F 


M00064375B:G07 


CH22PRC 


21 


404375 


RTA22200260F.m.08 . 1 .P.Seq 


F 


M00063967D:G02 


CH22PRC 


22 


391820 


RTA22200261F.f.02. 1 .P.Seq 


F 


M00064000B:C03 


CH22PRC 


23 


672003 


RTA22200267F.i.06. 1 .P.Seq 


F 


M00064693D:F08 


CH22PRC 


24 


830620 


RTA22200263F.n.09. 1 .P.Seq 


F 


M00064424B:C12 


CH22PRC 


25 


450399 


RTA22200251F.f.23.1.P.Seq 


F 


M00063467D:H07 


CH21PRN 


26 


450982 


RTA22200261F.n. 18. 1 .P.Seq 


F 


M00064307B:G02 


CH22PRC 


27 


819894 


RTA22200264F.h. 18. 1 .P.Seq 


F 


M00064467B:D06 


CH22PRC 


28 


379302 


RTA22200257F.j.02.3.P.Seq 


F 


M00064178C:C04 


CH21PRN 


29 


379746 


RTA22200256F.e. 16. 1 .P.Seq 


F 


M00064086C:E01 


CH21PRN 


30 


124863 


RTA22200265F.m.06. 1 .P.Seq 


F 


M00064564A:CO2 


CH22PRC 


31 


379154 


RTA22200257F.C. 11.1 .P.Seq 


F 


M00064151B:C07 


CH21PRN 


32 


830620 


RTA22200262F.1.23.1. P.Seq 


F 


M00064358C:D09 


CH22PRC 


33 


389409 


RTA22200266F.1.24. 1 .P.Seq 


F 


M00064631A:C07 


CH22PRC 


34 


397284 


RTA22200262F.i.22. 1 .P.Seq 


F 


M00064346C:B09 


CH22PRC 


35 


819440 


RTA22200264F.e. 19.1 .P.Seq 


F 


M00064454C:B06 


CH22PRC 


36 


389409 


RTA22200266F.m.01 . 1 .P.Seq 


F 


M00064631A:C07 


CH22PRC 


37 


518848 ; 


RTA22200265F.n. 15.1 .P.Seq 


F 


M00064571C:C04 


CH22PRC 


38 


830620 


RTA22200263F.a.21.1. P.Seq 


F 


M00064376A:A05 


CH22PRC 


39 


379154 


RTA22200256F.f.20. 1 .P.Seq 


F 


M00064090D:D09 


CH21PRN 


40 


818544 


RTA22200256F.h.04. 1 .P.Seq 


F 


M00064105B:A03 


CH21PRN 


41 


817375 


RTA22200251F.a. 15 . 1 .P.Seq 


F 


M00063152C:B07 


CH21PRN 
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42 


455264 


RTA22200259F.e.23. 1 .P.Seq 


F 


M00063539C:C11 


CH22PRC 


43 


817503 


RTA22200266F.k. 11.1 .P.Seq 


F 


M00064624D:C09 


CH22PRC 


44 


377696 


RTA22200256F.d.21 . 1 .P.Seq 


F 


M00064082D:D10 


CH21PRN 


45 


375596 


RTA22200261F.h. 10.1. P.Seq 


F 


M00064009A:C01 


CH22PRC 


46 


817689 


RTA22200263F.h.05. 1 .P.Seq 


F 


M00064399A:E01 


CH22PRC 


47 


831867 


RTA22200262F.i.l5.2.P.Seq 


F 


M00064345A:A03 


CH22PRC 


48 


830085 


RTA22200261F.k. 14. 1 .P.Seq 


F 


M00064293D:B12 


CH22PRC 


49 


389627 


RTA22200264F.C. 10. 1 .P.Seq 


F 


M00064447B:C06 


CH22PRC 


50 


397284 


RTA22200259F.k.09. 1. P.Seq 


F 


M00063555B:D01 


CH22PRC 


51 


380063 


RTA22200261F.j. 02. 1. P.Seq 


F 


M00064014D:H05 


CH22PRC 


52 


830931 


RTA22200266F.m.23. 1 .P.Seq 


F 


M00064633CA03 


CH22PRC 


53 


819321 


RTA22200257F.1.03.3.P.Seq 


F 


M00064194C:D02 


CH21PRN 


54 


475587 


RTA22200261F .c.Ol . 1 .P.Seq 


F 


M00063990A:D05 


CH22PRC 


55 


819046 


RTA22200255F.a. 18. l.P.Seq 


F 


M00063920D:H05 


CH21PRN 


56 


817477 


RTA22200253F.g.21.1. P.Seq 


F 


M00063784A:H12 


CH21PRN 


57 


475587 


RTA22200261F.b.24. 1 .P.Seq 


F 


M00063990A:D05 


CH22PRC 


58 


728115 


RTA22200253F.p.01 . 1 .P.Seq 


F 


M00063838B:G08 


CH21PRN 


59 


389627 


RTA22200260F.L24. l.P.Seq 


F 


M00063957A:E02 


CH22PRC 


60 


403453 


RTA22200256F.L24. 1. P.Seq 


F 


M00064113B:C04 


CH21PRN 


61 


508525 


RTA22200255F.d. 10. l.P.Seq 


F 


M00063931B:F07 


CH21PRN 


62 


819525 


RTA22200261F.n.20. l.P.Seq 


F 


M00064307C:G03 


CH22PRC 


63 


817618 


RTA22200255F.L03. l.P.Seq 


F 


M00064025D:H12 


CH21PRN 


64 


819403 


RTA22200254F.h. 14. 1 .P.Seq 


F 


M00063888D:D05 


CH21PRN 


65 


553242 


RTA22200254F.g. 20. l.P.Seq 


F 


M00063886A:B06 


CH21PRN 


66 


817417 


RTA22200255F.a. 10. l.P.Seq 


F 


M00063919C:E07 


CH21PRN 


67 


817618 


RTA22200252F.f. 13. l.P.Seq 


F 


M00063604A:B11 


CH21PRN 


68 


611440 


RTA22200262F.e.04.2.P.Seq 


F 


M00064328B:H09 


CH22PRC 


69 


817375 


RTA22200260F.m.06. 1 .P.Seq 


F 


M00063967CA12 


CH22PRC 


70 


213577 


RTA22200255F.L23. l.P.Seq 


F 


M00064033C:C11 


CH21PRN 


71 


820061 


RTA22200265F.p. 10. l.P.Seq 


F 


M00064579D:E11 


CH22PRC 


72 


455264 


RTA22200259F.m.06. 1 .P.Seq 


F 


M00063559D:G03 


CH22PRC 


73 


455264 


RTA22200255F.O.23. l.P.Seq 


F 


M00064059A:C11 


CH21PRN 


74 


380331 


RTA22200255F.b. 19. 1 .P.Seq 


F 


M00063926A:H04 


CH21PRN 


75 


380331 


RTA22200252F.b. 19. 1 .P.Seq 


F 


M00063518DA01 


CH21PRN 


76 


817455 


RTA22200267F.O.01. l.P.Seq 


F 


M00064723D:H03 


CH22PRC 


77 


423967 


RTA22200252F.a.20. 1 .P.Seq 


F 


M00063515B:H02 


CH21PRN 


78 


220584 


RTA22200261F.m. 14. 1 .P.Seq 


F 


M00064302A:D10 


CH22PRC 


79 


817688 


RTA22200251F.e.20. l.P.Seq 


F 


M00063462D:D07 


CH21PRN 


80 


549934 


RTA22200253F.n. 10. l.P.Seq 


F 


M00063826A:D03 


CH21PRN 


81 


819149 


RTA22200255F.e. 16. l.P.Seq 


F 


M00063938B:H07 


CH21PRN 


82 


817455 


RTA22200267F.n.24. l.P.Seq 


F 


M00064723D:H03 


CH22PRC 
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83 


377696 


RTA22200251F.j.03. 1 .P.Seq 


F 


M00063482A:F07 


CH21PRN 


84 


830146 


RTA22200260F.b.07. 1 .P.Seq 


F 


M00063578B:E02 


CH22PRC 


85 


194490 


RTA22200264F.1.07. 1. P.Seq 


F 


M00064481C:F03 


CH22PRC 


86 


819460 


RTA22200257F.m. 15.3.P.Seq 


F 


M00064200D:E08 


CH21PRN 


87 


819018 


RTA22200257F .p . 0 1 . 3 . P . Seq 


F 


M00064212D:E04 


CH21PRN 


88 


830620 


RTA22200259F.p.24. 1 .P.Seq 


F 


M00063571B:G03 


CH22PRC 


89 


141079 


RTA22200262F.k. 19. 1 .P.Seq 


F 


M00064354A:A10 


CH22PRC 


90 


376588 


RTA22200256F.e.04. 1 .P.Seq 


F 


M00064083D:E05 


CH21PRN 


91 


380604 


RTA22200264F.g.05. 1 .P.Seq 


F 


M00064460C:B01 


CH22PRC 


92 


413138 


RTA22200260F.b.05. 1 .P.Seq 


F 


M00063577C:C02 


CH22PRC 


93 


818544 


RTA22200265F.e. 12. 1 .P.Seq 


F 


M00064527A:H07 


CH22PRC 


94 


647435 


RTA22200257F.h.08. 1 .P.Seq 


F 


M00064172C:A02 


CH21PRN 


95 


551785 


RTA22200266F.C.09. 1 .P.Seq 


F 


M00064593A:A05 


CH22PRC 


96 


17092 


RTA2220026 1 F . f . 1 7 . 1 . P . Seq 


F 


M00064002C:F06 


CH22PRC 


97 


818326 


RTA22200251F.i.06.1.P.Seq 


F 


M00063478C:D01 


CH21PRN 


98 


377944 


RTA22200262F.e.03.2.P.Seq 


F 


M00064328B:H04 


CH22PRC 


99 


745559 


RTA22200262F.m.04. 1 .P.Seq 


F 


M00064359B:H12 


CH22PRC 


100 


818326 


RTA22200265F.d.08. 1 .P.Seq 


F 


M00064524A:A09 


CH22PRC 


101 


379879 


RTA22200264F.b.23 . 1 .P.Seq 


F 


M00064446A:D11 


CH22PRC 


102 


819640 


RTA22200257F.f.24. 1 .P.Seq 


F 


M00064165A.-B12 


CH21PRN 


103 


818326 


RTA22200265F.a. 14. 1 .P.Seq 


F 


M00064514D:F11 


CH22PRC 


104 


243524 


RTA22200265F.g.04. 1 .P.Seq 


F 


M00064532D:G06 


CH22PRC 


105 


43995 


RTA22200261F.1.02.1. P.Seq 


F 


M00064294D:F01 


CH22PRC 


106 


597854 


RTA22200262F . g . 06 . 2 . P . Seq 


F 


M00064337D:F01 


CH22PRC 


107 


268290 


RTA22200260F.p. 14. 1 .P.Seq 


F 


M00063981DA06 


CH22PRC 


108 


818043 


RTA22200256F.p. 10.2.P.Seq 


F 


M00064138A:F11 


CH21PRN 


109 


830930 


RTA22200267F.b.03. 1 .P.Seq 


F 


M00064652B:D09 


CH22PRC 


110 


389627 


RTA22200260F .j . 0 1 . 1 . P . Seq 


F 


M00063957A:E02 


CH22PRC 


111 


378730 


RTA22200260F.L07. 1. P.Seq 


F 


M00063955C:F07 


CH22PRC 


112 


819037 


RTA22200260F.n.09. 1. P.Seq 


F 


M00063972C:E10 


CH22PRC 


113 


830397 


RTA22200261F.g. 14.1. P.Seq 


F 


M00064005DA08 


CH22PRC 


114 


450247 


RTA22200261F.e. 10.1. P.Seq 


F 


M00063998C:E09 


CH22PRC 


115 


819273 


RTA22200252F.b.09. 1 .P.Seq 


F 


M00063517AA04 


CH21PRN 


116 


587779 


RTA22200257F.i. 1 1 .3.P.Seq 


F 


M00064175B:B09 


CH21PRN 


117 


818639 


RTA22200256F.j .09. 1 .P.Seq 


F 


M00064115B:E12 


CH21PRN 


118 


615617 


RTA22200261F.o.l3.1.P.Seq 


F 


M00064309C:H09 


CH22PRC 


119 


79309 


RTA22200257F.j. 13. 3. P.Seq 


F 


M00064180A:G03 


CH21PRN 


120 


748994 


RTA22200261F.o.20.1.P.Seq 


F 


M00064310CA10 


CH22PRC 


121 


818682 


RTA22200258F.h.07. 1 .P.Seq 


F 


M00064271B:D03 


CH21PRN 


122 


373061 


RTA22200253F.j. 09. 1. P.Seq 


F 


M00063795C:D09 


CH21PRN 


123 


484413 


RTA22200253F.g.09. 1. P.Seq 


F 


M00063781B:B10 


CH21PRN 
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124 


819273 


RTA22200258F.h.04. 1 .P.Seq 


F 


M00064270B:B03 


CH21PRN 


125 


569532 


RTA22200252F.h. 18. 1 .P.Seq 


F 


M00063613D:C11 


CH21PRN 


126 


170313 


RTA22200255F.g.20. 1 .P.Seq 


F 


M00063949D:A05 


CH21PRN 


127 


818682 


RTA22200253F .p.l4.1.P.Seq 


F 


M00063841A:B09 


CH21PRN 


128 


377188 


RTA22200255F.1.06. 1 .P.Seq 


F 


M00064043D:C09 


CH21PRN 


129 


518848 


RTA22200257F.j . 22 . 3 . P. Seq 


F 


M00064186C:B03 


CH21PRN 


130 


45592 


RTA22200259F.1.08. 1 .P.Seq 


F 


M00063557D:C07 


CH22PRC 


131 


819273 


RTA22200255F.n. 19. LP. Seq 


F 


M00064053C-.G04 


CH21PRN 


132 


397284 


RTA2220025 IF . a. 06 . 1 . P. Seq 


F 


M00063151D:B10 


CH21PRN 


133 


818326 


RTA22200258F.e. 14. 1 .P.Seq 


F 


M00064260C:E05 


CH21PRN 


134 


819037 


RTA22200251F.c.l5.LP.Seq 


F 


M00063452A:F08 


CH21PRN 


135 


817417 


RTA22200253F . m. 14 . 1. P . Seq 


F 


M00063818C:A09 


CH21PRN 


136 


819640 


RTA22200254F.L 1 1 . l.P.Seq 


F 


M00063891A:F11 


CH21PRN 


137 


818771 


RTA22200254F.L 19.1 .P.Seq 


F 


M00063892B-.G02 


CH21PRN 


138 


389627 


RTA22200254F.k. 10. l.P.Seq 


F 


M00063898A:A10 


CH21PRN 


139 


379067 


RTA22200260F.e.20. l.P.Seq 


F 


M00063593A:D03 


CH22PRC 


140 


818544 


RTA2220025 1 F . f . 02 . 1. P . Seq 


F 


M00063463D:B05 


CH21PRN 


141 


819440 


RTA22200251FJ. 22. l.P.Seq 


F 


M00063485A:E05 


CH21PRN 


142 


817417 


RTA22200251F.k. 10. 1 .P.Seq 


F 


M00063487C:C02 


CH21PRN 


143 


385307 


RTA22200262F.k. 11.1 .P.Seq 


F 


M00064352C-.H01 


CH22PRC 


144 


611440 


RTA22200263F.d.24.2.P.Seq 


F 


M00064386B:C02 


CH22PRC 


145 


376056 


RTA22200259F . e . 16. l.P.Seq 


F 


M00063538D:B01 


CH22PRC 


146 


611440 


RTA22200263F.d.24. 1 .P.Seq 


F 


M00064386B:C02 


CH22PRC 


147 


820061 


RTA22200264F.L09. l.P.Seq 


F 


M00064457D:C09 


CH22PRC 


148 


617825 


RTA22200264F.p.06. 1 .P.Seq 


F 


M00064508A:B09 


CH22PRC 


149 


819440 


RTA22200257F .h . 17 . 1. P . Seq 


F 


M00064173B:E01 


CH21PRN 


150 


819145 


RTA22200266F . m. 08 . 1 . P . Seq 


F 


M00064631C:H11 


CH22PRC 


151 


817653 


RTA22200265F.p. 07. l.P.Seq 


F 


M00064579A:C06 


CH22PRC 


152 


611440 


RTA22200263F.e.01 . 1 .P.Seq 


F 


M00064386B:C02 


CH22PRC 


153 


375958 


RTA22200264F . j . 22 . 1. P . Seq 


F 


M00064476D:C04 


CH22PRC 


154 


611440 


RTA22200257F. a. 20. l.P.Seq 


F 


M00064144D:A07 


CH21PRN 


155 


831049 


RT A22200266F . o . 13. l.P.Seq 


F 


M00064637B:F03 


CH22PRC 


156 


818162 


RTA22200266F . g . 18. LP. Seq 


F 


M00064610D:H01 


CH22PRC 


157 


553200 


RTA22200263 F .p. 02. l.P.Seq 


F 


M00064429D:B07 


CH22PRC 


158 


139677 


RTA22200254F .o. 07. l.P.Seq 


F 


M00063910D:A12 


CH21PRN 




ijyo/ / 


K 1 AZZzUuzD/Jr .c. 1 1 . 1. P.Seq 


F 


M00063520D:E1 1 


CH21PRN 


160 


397284 


RTA22200262F.i.22.2.P.Seq 


F 


M00064346C:B09 


CH22PRC 


161 


385810 


RTA22200256F.m.04.2.P.Seq 


F 


M00064126C:F12 


CH21PRN 


162 


404624 


RTA22200261F.e. 07. l.P.Seq 


F 


M00063997C:B12 


CH22PRC 


163 


375958 


RTA22200262F.b.l4.2.P.Seq 


F 


M00064322C:A10 


CH22PRC 


164 


616555 


RTA22200265F.b.24. l.P.Seq 


F 


M00064520A:E04 


CH22PRC 
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165 


616555 


RTA22200265F.C.01 . 1 .P.Seq 


F 


M00064520A:E04 


CH22PRC 


166 


295694 


RTA22200260F.O.20. 1 .P.Seq 


F 


M00063978B:B06 


CH22PRC 


167 


36113 


RTA22200265F.e.06.1. P.Seq 


F 


M00064526D:F05 


CH22PRC 


168 


831812 


RTA22200263F.f.05. 1 .P.Seq 


F 


M00064390A:C05 


CH22PRC 


169 


817653 


RTA22200252F . g . 23 . 1 . P . Seq 


F 


M00063610D:C11 


CH21PRN 


170 


397284 


RTA22200252F.m. 15 . 1 .P.Seq 


F 


M00063636A:E01 


CH21PRN 


171 


817979 


RTA22200253F.p. 15 . 1 .P.Seq 


F 


M00063841A:E08 


CH21PRN 


172 


817653 


RTA22200255F.m. 18.1 .P.Seq 


F 


M00064048C:G12 


CH21PRN 


173 


611440 


RTA22200253F.f .03. 1 .P.Seq 


F 


M00063774A-.D09 


CH21PRN 


174 


386014 


RTA22200261F.106.1. P.Seq 


F 


M00064001A:B03 


CH22PRC 


175 


549981 


RTA22200255F.b. 10. 1 .P.Seq 


F 


M00063925B:F04 


CH21PRN 


176 


193373 


RTA22200255F 1 21 1 P Seq 


F 


M00064046A:G02 


CH21PRN 


177 


400619 


RTA22200255F.g. 14. 1 .P.Seq 


F 


M00063947D:D01 


CH21PRN 


178 


831149 


RTA22200261F o 21 1 P Sea 

XV. X JL l^iJ^VV/JWV/ X X • V-f • X • X ■ X ■ WwU 


F 


M00064310D:F03 


CH22PRC 


179 


36113 


RTA22200255F d 16 1 P.Seq 


F 


M00063932D:G08 


CH21PRN 


180 


817503 


RTA22200253F 1 16 1 P Seq 


F 


M00063805D:E05 


CH21PRN 


181 


376588 


RTA22200260F i 1 1 1 P Sea 

XV X JL X*W^wUUiWUV/l >A«XX*X>X ■ VVI 


F 


M00063955D:F05 


CH22PRC 


182 


141079 


RTA22200252F .f. 23.1. P. Seq 


F 


M00063606C:B04 


CH21PRN 


183 


818063 


RTA22200253F.p.04. 1 .P.Seq 


F 


M00063839A:F01 


CH21PRN 


184 


455264 


RTA22200253F n 14 1 P.Seq 

XV X X X iJVV^n/ X » XX • X I • X • X • M 


F 


M00063828A-.H12 


CH21PRN 


185 


189234 


RTA22200251F.f. 17.1 .P.Seq 


F 


M00063466C:C11 


CH21PRN 


186 


295694 


RTA22200265F i 05 1 P Seq 


F 


M00064550A:A07 


CH22PRC 


187 


648679 


RTA22200260F.f.06. 1 .P.Seq 


F 


M00063594B:H07 


CH22PRC 


188 


830930 


RTA22200264F . e . 1 0 . 1 . P. Seq 


F 


M00064452D:E11 


CH22PRC 


189 


818497 


RTA2220025 6F . d . 07 . 1 . P . Seq 


F 


M00064079C:A10 


CH21PRN 


190 


373928 


RTA2220025 6F . d . 1 9 . 1 . P.Seq 


F 


M00064082A:A08 


CH21PRN 


191 


385307 


RTA22200263F.j. 12. 1 .P.Seq 


F 


M00064406B:H06 


CH22PRC 


192 


403453 


RTA22200266F.e. 10. 1 .P.Seq 


F 


M00064601D:B05 


CH22PRC 


193 


730318 


RTA22200264F.C.09. 1 .P.Seq 


F 


M00064447B:A07 


CH22PRC 


194 


44183 


RTA22200271F.a.01 . 1 .P.Seq 


F 


M00021929A-.D03 


CH03MAH 


195 ' 


373928 


RTA22200255F . d . 22 . 1 .P.Seq 


F 


M00063934B:E04 


CH21PRN 


196 


404624 


RTA22200255F . d . 23 . 1 .P.Seq 


F 


M00063934C:C10 


CH21PRN 


197 


403173 


RTA22200253F.a.21 . 1 .P.Seq 


F 


M00063685A:C02 


CH21PRN 


198 


372700 


RTA22200253F . c . 06 . 1 . P. Seq 


F 


M00063689D:E12 


CH21PRN 


199 


374343 


RTA22200261F.h.04.1. P.Seq 


F 


M00064008A:B01 


CH22PRC 


200 


597854 


RTA22200255F.j .03 . 1 .P.Seq 


F 


M00064033D:B01 


CH21PRN 


201 


817417 


RTA22200255F.a.23 . 1 .P.Seq 


F 


M00063922B:A12 


CH21PRN 


202 


818497 


RTA22200257F.k.05.3.P.Seq 


F 


M00064188B:G08 


CH21PRN 


203 


377696 


RTA22200255F.f. 15. 1 .P.Seq 


F 


M00063943B:G12 


CH21PRN 


204 


379105 


RTA22200252F.n. 19. 1 .P.Seq 


F 


M00063642B:A08 


CH21PRN 


205 


831188 


RTA22200267F.O.02. 1 .P.Seq 


F 


M00064723D:H11 


CH22PRC 
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ID 


CLUSTER 


SEQ NAME 


ORIENT 


CLONE ID 


LIBRARY 

#X^X XJL\ X 


206 


376056 


RTA22200253F.m.09. 1 .P.Seq 


F 


M00063810C-E03 


CH21PRN 


207 


124863 


RTA22200255F.n. 15. 1 .P.Seq 


F 


M00064053B:D09 


CH21PRN 

^- — * X ■ fi^ X X AVI ^ 


208 


376056 


RTA22200254F . i . 03 . 1 . P. Seq 


F 


M00063890A:F11 


CH21PRN 

Vj-XXjwXX 1V1 1 


209 


831812 


RTA22200266F.j .10. 1 .P.Seq 


F 


M00064620C-D01 


CH22PRC 

^— ^X X r f f >X XV A — ^ 


210 


141079 


RTA22200260F.i. 14. 1 .P.Seq 


F 


M00063956A:F05 


CH22PRC 

V/li.uiL'X XVV-o 


211 


19148 


RTA22200265F . o . 1 8 . 1 . P . Seq 


F 


M00064577C-B12 

ATJ.V V/W lw/ / / • X-J X XW 


CH22PRC 

w X X r > t >X IVVy 


212 


124396 


RTA22200252F a 14 1 P Sea 


F 


M00063514CE08 


CH21PRN 

V — ' 11- i .1 XvJ. ^ 


213 


831026 


RTA22200265F . c . 03 . 1 . P . Seq 


F 


M00064520A:F08 


CH22PRC 

v^AAiW>WA AVV> 


214 


819037 


RTA22200263F i 23 1 P Sea 


F 


M00064405B-C04 


CH22PRC 

wAJ, r f.f 4 A. XVV ^ 


215 


380207 


RTA22200263F i 19 1 P Sea 


F 


M00064404C * G05 


CH22PRC 


216 


819460 


RTA22200255F c 13 1 P Sea 

XX. JL X li^^^v Xw/.4Ww' JL • W • JL *J m JL ■ X • k-J 


F 


M00063928 A ■ G09 


CH21PRN 


217 


379067 


RTA22200253F e 23 1 P Sea 


F 


M00063784C ■ E 1 0 


CH^IPRN 


218 


403173 


RTA22200252F o 23 1 P Sea 


F 


M00063682A ■ C04 


CH21PRN 

> — ' X 11^ X A JAl 1 


219 


3856 


RTA22200269F a 05 1 P Sea 


F 


M00003773D ■ H02 


V^X 1U X V>V/11 


220 


378551 


RTA22200263F d 17 1 P Sea 


F 


M00064385D • C 1 1 


CH22PRC 


221 


456089 


RTA22200272F a 09 1 P Sea 


F 


M00043 1 34 A ■ A05 


CH19COP 


222 


549981 


RTA22200267F 2lTL\V Sea 


F 


M00064650B-B07 


v. - 1 J. — ^ z_ I xvV_x 


223 


378551 


RTA22200265F m 21 1 P Sea 

AV X ± V ^— ' j—i J—i \J W Xw V_/ X • XAX • XW X • X • X • WJv v| 


F 


M00064568 A ■ H06 


CH22PRC 


224 


819201 


RTA22200256F n 23 2 P Sea 

A\ X X AXW r t,r f\J \J d^^J V_/ X • J.X ■ xW*^r » xw • X * WJvU 


F 


M00064 1 32B ■ B07 


CH21PRN 


225 


374826 


RTA22200251F c 20 1 P Sea 

xv x xma^vvw xx • w • ^ v • x . x • v_j vu 


F 


M00063453B ■ F08 

J. TX V \J \J \J -J Y*-/ XJ» • X \J \J 


CH21PRN 

Vw>- X XZrf X X XVJ. 1 


226 


389409 


RTA22200253F 1 23 1 P Sea 


F 


M00063807A * D 12 


CH21PRN 

VlliJii XVJ. i 


227 


819149 


RTA22200260F a 17 1 P Sea 

xx. x x ii^iWiWvV/jyu v_/ x « <x * x / • x • x • k^vVI 


F 


M00063575B ■ G02 

xtx v/ w v/w/»y / «y j-j" . vj yj^r 


CH22PRC 


228 


389409 


RTA22200255F.e. 18. 1 .P.Seq 


F 


M00063939C:D06 


GH21PRN 

V — ' X X. A—t X X XVJ. ^ 


229 


818165 


RTA22200254F.h. 15 . 1 .P.Seq 


F 


M00063888D ■ F02 

ATXvyWV/viM/UULyJL/ ■ J V_/Xw 


CH21PRN 

V^XXXvXX IVll 


230 


817757 


RTA22200252F.L 15.1 .P.Seq 


F 


M00063617D-F09 


CH21PRN 

Vx X X ^ X X XVJ. 1 


231 


553242 


RTA22200263F.i.20. 1 .P.Seq 


F 


M00064404D ■ A06 

ATJlvrV/v/V I ■ V_/ iJL' ■ X JL W 


CH22PRC 

X X < if rX XVV— » 


232 


385615 


RTA22200265 F .b. 08. LP. Seq 


F 


M00064517B:F10 


CH22PRC 

V— X X f ^X Av Vy 


233 


819102 


RTA22200258F.h. 19. 1 .P.Seq 


F 


M00064272C-G01 


CH21PRN 

V^XXiWXX XVJ. 1 


234 


817757 


RTA22200255F.0. 16. l.P.Seq 


F 


M00064057C:H10 


CH21PRN 


235 


385615 


RTA22200265F.b.07. l.P.Seq 


F 


M00064517B:F04 


CH22PRC 


236 


385615 


RTA2220025 3 F . 1 . 06 . 1. P. Seq 


F 


M00063804C:A11 


CH21PRN 


237 


827355 


RTA22200266F.n.23. l.P.Seq 


F 


M00064636B:A04 


CH22PRC 


238 


817629 


RTA22200259F. a. 13.1. P. Seq 


F 


M00063165A:C09 


CH22PRC 


239 


817514 


RTA22200260F.h.02. l.P.Seq 


F 


M00063600C:C09 


CH22PRC 

^w^X,l.,f fit fJL AV\> 


240 


817514 


RTA22200252F.p.21. l.P.Seq 


F 


M00063681B:C02 


CH21PRN 


241 


680563 


RTA22200265F.f. 13.1 .P.Seq 


F 


M00064530B:H02 


CH22PRC 


242 


827355 


RTA22200255F.e.20. l.P.Seq 


F 


M00063939C:H01 


CH21PRN 


243 


377286 


RTA22200254F.a.04. 1 .P.Seq 


F 


M00063843B:D07 


CH21PRN 


244 


680563 


RTA22200258F.g. 18. 1 .P.Seq 


F 


M00064268D:G03 


CH21PRN 


245 


819156 


RTA22200255F.h.06. 1 .P.Seq 


F 


M00064021D-.H01 


CH21PRN 


246 


220584 


RTA22200261F.f.22. l.P.Seq 


F 


M00064003B:C10 


CH22PRC 
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LIBRARY 


247 


616555 


RTA22200263F.0. 12. 1 .P.Seq 


F 


M00064428B:A12 


CH22PRC 


248 


819498 


RTA22200254F . 0 . 1 4 . 1. P . Seq 


F 


M00063912A:D06 


CH21PRN 


249 


817508 


RTA22200257F.h.01. 1 .P.Seq 


F 


M00064171D:E05 


CH21PRN 


250 


O 1 C\(~\ 

817690 


RTA22200257F . e . 05 . 1 . P . Seq 


F 


M00064 1 5 9 A : H03 


CH21PRN 


251 


819156 


RTA2220025 6F.h.l3.l.P.Seq 


F 


M00064 1 06C : G03 


CH21PRN 


252 


830904 


RTA22200266F j . 12 . 1 . P . Seq 


F 


M00064620D : G05 


CH22PRC 


253 


81 9498 


RTA22200253F.b.04. 1 .P.Seq 


F 


M00063686B:E07 


CH21PRN 


254 


817508 


RTA22200257F . g . 24 . 1 . P . Seq 


F 


M00064171D:E05 


CH21PRN 


255 


817508 


RTA22200252F .a. 19.1. P. Seq 


F 


M00063515B:F06 


CH21PRN 


256 


831160 


RTA22200267F .h. 01.1. P. Seq 


F 


M00064690A : C04 


CH22PRC 


257 


817762 


RTA22200252F . k . 13. LP. Seq 


F 


M00063627C.F06 


CH21PRN 


258 


377286 


T\nn a AAAAAAy*/"n i r\i~t A t-v 

RTA22200266F.k.07. 1. P.Seq 


F 


M00064624C:B03 


CH22PRC 


259 


831160 


RTA22200267F . g . 24 . 1 . P . Seq 


F 


M00064690A:C04 


CH22PRC 


260 


819994 


RTA22200256F.k. 11.1 .P.Seq 


F 


M00064119C:D12 


CH21PRN 


261 


819994 


RTA22200256F.k.09. LP. Seq 


F 


M00064119B:H10 


CH21PRN 


262 


373298 


RTA22200259F . c . 1 9 . 1. P . Seq 


F 


M00063533A:C12 


CH22PRC 


263 


819894 


RTA22200256F.m.03.2.P.Seq 


F 


M00064126C:C02 


CH21PRN 


264 


372718 


RTA22200260F.b.22. LP. Seq 


F 


M00063580D:B06 


CH22PRC 


265 


827355 


RTA22200262F .1. 20. LP. Seq 


F 


M00064358A.G03 


CH22PRC 


266 


819894 


RTA22200255F . d . 09 . 1. P . Seq 


F 


M00063931B:E10 


CH21PRN 


267 


827355 


RTA22200266F .e. 07. LP. Seq 


F 


M00064601C:G07 


CH22PRC 


268 


372718 


RTA22200256F . 1 . 03 . 1. P . Seq 


F 


M00064122C:B06 


CH21PRN 


269 


647435 


RTA2220025 lF.b.lO.LP.Seq 


F 


M00063156D:H10 


CH21PRN 


270 


450262 


RTA22200265F.a. 10. 1 .P.Seq 


F 


M00064514A.G10 


CH22PRC 


271 


484703 


RTA22200255F.1.20. 1 .P.Seq 


F 


M00064032D:G04 


CH21PRN 


272 


819498 


RTA22200256F. f . 12.1 .P.Seq 


F 


M00064089B.F09 


CH21PRN 


273 


406043 


RTA22200263F.1. 12.1 .P.Seq 


F 


M00064404A :B05 


CH22PRC 


274 


817500 


RTA22200255F.L24. 1 .P.Seq 


F 


M00063945A:C03 


CH21PRN 


275 


818180 


RTA22200264F .0. 18. LP. Seq 


F 


M00064506A:C07 


CH22PRC 


276 


818143 


RTA2220025 1 F . a . 03 . 1. P . Seq 


F 


M00063151A:G06 


CH21PRN 


277 


819756 


RTA22200267F .a. 18. LP. Seq 


F 


M00064649 A : E04 


CH22PRC 


2/5 


406908 


R 1 A22200257F . 1.18.3 . P. Seq 


F 


M00064 1 76D : H 1 0 


CH21PRN 


279 


124863 


RTA22200256F.O.21 .2. P. Seq 


F 


M00064l36C:Dl2 


CH21PRN 


280 


429009 


RTA22200257F.e.24. 1 .P.Seq 


p-i 
F 


M00064 1 6 IB : G04 


CH21PRN 


281 


402586 


RTA22200257F . 1 . 24 . 3 . P . Seq 


F 


M00064 1 78B : A05 


CH21PRN 




4UU4 /J 


K 1 AZZZUUZD4Jr . 1 . U4 . 1 . r . i>eq 


r 


MOUOoi 890A : 1104 


CH21PRN 


283 


403453 


RTA22200264F.d. 12. 1 .P.Seq 


F 


M00064450C:E07 


CH22PRC ! 


284 


383021 


RTA22200259F.d.06. 1 .P.Seq 


F 


M00063534C:A02 


CH22PRC 


285 


394913 


RTA22200254F.p. 10. 1 .P.Seq 


F 


M00063915C:E01 


CH21PRN 


286 


831361 


RTA22200263F.k. 19.1 .P.Seq 


F 


M00064414D:D06 


CH22PRC 


287 


646020 


RTA22200267F.n.21 . 1 .P.Seq 


F 


M00064723C:H04 


CH22PRC 
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ID 


PT TT<?TFP 


Cpf) "NTAA/TF 
OJ2>\J lN/\iVlJJ/ 




f^T OMF TFl 

V^J-rLiiNJD iU 


T TPP A P"V 


9RR 


R^1 %f%\ 


PTA999nfY9£'3F 1 O,^ 1 P C^n 

x\.i/\.zzzuuzooj7.i.uj. i ..r .oeq 


Jr 


IVIUUUOH-H- 1 3r> . \J\Jj 


PUOODDP 

CJtlZZJrKL. 


9RQ 


OJlJOU 


T?TA999nn9A1 Th f 1Q 1 p 

xvi /vzzzuuzoxr 1 .1. io.i .Jr.ocq 


Th 

r 


JVlUUUO^f UUZi^ . Jtiuy 


UHZZJrKC 




4n9^R^» 

^fUZJOO 


PTA999nn9^7P i n.1 ^ P Q^n 
xv i i\zzzuuz.j / Jr .J . u I . d . r\ oeq 


r 


JVIUUU041 /oxd.AUD 


CliZlFKJN 


9Q1 

zyx 




PTA999nn9A9"R J 91 1 P 

xv i azzzuuzozjt .j.zi . I .i^.oeq 


Jr 


MUUU04 JJ . JrlU 1 


CJrlzZFKC 


9Q9 
Z!?Z 


OXOZO / 


PTA 999009/^9 Th J-i 1 zl 9 P C<=^ 

xvi/\zzzuuzozj H .n. l'H-.z.Jr.oeq 


Jr 


JV1UUU04341A.CUZ 


CHZzirKC 


9Q^ 

zyo 




PT 1 A999HH9A1 In i 9H 1 P Con 

Kl AZZZUUZ01.T.J.ZU. l.jr.oeq 


T7 

Jr 


MUUUD4U 1 oL . -cAJ / 


CHzzPRC 






xv 1 /\zzzuuzoDx H .m.z<+. i.Jr.oeq 


Jr 


MUUUo4DoyjD : A(J9 


CHzzPRC 


9Q^ 
ZVD 


1 qa /ion 


Kl AZZZUUZDZ.r.C. 1U. 1 .F.oeq 


r 


MUUUo35zUU : DUo 


CHzlPRlN 


9QA 

zyo 


Q 1 Q 1 /1Q 

olol4^ 


K 1 AZZZUUZD4Jr . D . 1 o . 1 . P. oeq 


T7 

r 


MUU06384oC:vj1 1 


PITTA -I T\T\\T 

CH21PRN 


zy / 


5 / /Zoo 


Kl AzzzUUzDyr .a. 10. l.F.oeq 


b 


M0U0O3 1 63 A : Cj04 


CH22PRC 


zy<5 


ool^Ol 


Kl AZZZUUZoD .r.n. Ul . l.F.oeq 


b 


MU(J0645o9B : A09 


CH22PRC 


zyy 


oojjU/ 


Kl AZZZUUZDDJr.p.U/. l.F.oeq 


. b 


M00064060B : D03 


CH21PRN 




D /o44/ 


Kl AZZZUUZDlP.C.Ul . i.p.oeq 


Jr 


JVlUU0o3 1 5oA : li 1 1 


CH21PRN 


1 


o /o44/ 


K 1 AZZZUUZD 1 Jr . D . Z4 . 1 . Sr. oeq 


r 1 


MUU063 1 5 o A : Jb 1 1 


CH21PRN 


QH9 
JUZ 


Q 1 TCI 4 

ol /D14 


Kl Azzzuuzouj^.m. 1 / . i.r.beq 


Jr 


A JAAA^'lA/COn .nno 

JVlUUUoJ 9ooD : CjUo 


CJH122PRC 






K 1 AZZZUUZDD Jr . I. U3 . 1 . sr . oeq 




MUUUo394 1 B : C 1 Z 


/~1 T TO 1 TTT1 AT 

CHzlPRN 


^04 




PTA99900967F e 9^ 1 P <sph 




iVXUUUOHhO / OXv .rUJ 


^xlZZJrxvL.. 


305 


817363 


RTA22200266F.f.04. 1 .P.Seq 


F 


M00064605C:G05 


CH22PRC 


306 


818942 


RTA22200255F.i.02. 1 .P.Seq 


F 


M00064025D:E07 


CH21PRN 


307 


818942 


RTA22200265F.g.23 . 1 .P.Seq 


F 


M00064534D-.F06 


CH22PRC 


308 


817457 


RTA22200267F.e. 15. 1 .P.Seq 


F 


M00064675C:E09 


CH22PRC 


309 


831968 


RTA22200263F.f.23. 1 .P.Seq 


F 


M00064393B:H04 


CH22PRC 


310 


530941 


RTA22200253F.h.05. 1 .P.Seq 


F 


M00063785C:F03 


CH21PRN 


311 


763446 


RTA22200257F.j .05 .3 .P.Seq 


F 


M00064179A:C04 


CH21PRN 


312 


763446 


RTA22200255F.n.21 . 1 .P.Seq 


F 


M00064053D:F02 


CH21PRN 


313 ; 


819219 


RTA22200256F.f. 16. 1 .P.Seq 


F 


M00064090C:A02 


CH21PRN 


314 


763446 


RTA22200258F.b. 19.2.P.Seq 


F 


M00064248A:E02 


CH21PRN 


315 


10154 










316 


10154 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






ID 


ALLJlSolU 

N 




T> A/ ATT T"C 


A /^/^"DCCT/^TVT 

ALCboMON 


T^\ T7 O /~i T» TTITTA "NT 

DLSCR1P1 ION 


P VALUE 


19 


<NONE> 


<NONE> 


<NONE 

> 


1077580 


hypothetical 
protein 
YDR125e - 
yeast 


7 


20 


<NONE> 


<NONE> 


<NONE 
> 


4585925 


(AC007211) 
unknown protein 


6 


21 


<NONE> 


<NONE> 


<NONE 
> 


1085306 


EVI1 protein - 
human 


4.3 


22 


<NONE> 


<NONE> 


<NONE 
> 


3876587 


(Z81521) 
predicted using 
Genefinder; 
cDNA EST 
yk233g4.5 
comes from this 
gene; cDNA 
EST yk233g4.3 
comes from this 
gene 

[Caenorhabditis 
elegans] 


0.85 


23 


< NONE > 




<NONE 

> 




similar to S. 
cervisiae nuclear 
protein SNF2 




24 


<NONE> 


<NONE> 


<NONE 
> 


157272 


(LI 1345) DNA- 

ViinHincr r>t*otPiTi 

[Drosophila 
melanogaster] 


0.29 


25 


<NONE> 


<NONE> 


<NONE 
> 


2633160 


(Z99108) 
similar to 

surface adhesion 
YfiQ [Bacillus 
subtilis] 


0.19 


26 


<NONE> 


<NONE> 


<NONE 
> 


755468 


(U 19879) 

transmembrane 

protein 

[Xenopus laevis] 


0.042 


27 


<NONE> 


<NONE> 


<NONE 

> 


4507339 


T brachyury 
(mouse) 

homolog protein 
[Homo sapiens] 


0.029 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






28 


<NONE> 


<NONE> 


<NONE 

> 


729711 


PROTEASE 
DEGS 

PRECURSOR 
3.4.21.-) hhoB- 
Escherichia coli 
>gi |558913 
(U15661) HhoB 
[Escherichia 
coli] 

>gi |606174 
(U18997) 
ORF o355 coli] 
>gi |1789630 
(AE000402) 
protease 
[Escherichia 
coli] 


0.004 


29 


<NONE> 


<NONE> 


<NONE 
> 


3168911 


(AF068718) No 
definition line 
found 

[Caenorhabditis 
elegans] 


8e-013 


30 


<NONE> 


<NONE> 


<NONE 

> 


2832777 


(AL021086) 
/prediction = (me 
thod:; comes 
from the 5 * 
UTR 

[Drosophila 
melanogaster] 


3e-040 


31 


X78712 


H. sapiens 
mRNA for 
glycerol kinase 
testis specific 2 


2.1 


2852449 


(D88207) 
protein kinase 
[Arabidopsis 
thaliana] 
>gi |2947061 
CAC002521^ 
putative protein 
kinase 


9.1 


32 


X60760 


L.esculentum 
TDR8 mRNA 


2.1 


157272 


(LI 1345) DNA- 
binding protein 
[Drosophila 
melanogaster] 


5 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






33 


U40853 


Oryctolagus 

cuniculus 

pulmonary 

surfactant 

protein B (SP-B) 

gene, complete 

cds 


2 


<NONE> 


<NONE> 


<NONE 

> 


34 


AF083655 


Homo sapiens 
procollagen C- 
proteinase 
enhancer protein 
(PCOLCE) 
gene, 5' 
flanking region 
and complete 
cds 


2 


<NONE> 


<NONE> 


<NONE | 

> 


35 


AJ223776 


Staphylococcus 
warned hid gene 


2 


<NONE> 


<NONE> 


<NONE 
> 


36 


U40853 


Oryctolagus 

cuniculus 

pulmonary 

surfactant 

protein B (SP-B) 

gene, complete 

cds 


2 


<NONE> 


<NONE> 


<NONE 

> 


37 


X04436 


Clostridium 
tetani gene for 
tetanus toxin 


2 


<NONE> 


<NONE> 


<NONE 
> 


38 


Z35787 


S.cerevisiae 
chromosome II 
reading frame 
ORF YBL026w 


2 


157272 


(LI 1345) DNA- 
binding protein 
[Drosophila 
melanogaster] 


8.4 


39 


X78712 


H. sapiens 
mRNA for 
glycerol kinase 
testis specific 2 


2 


2852449 


(D88207) 
protein kinase 
[Arabidopsis 
thaliana] 
>gi |2947061 
(AC002521) 
putative protein 
kinase 


8.2 
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Nearest 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






40 


Z15056 


B.subtilis genes 
spoVD, murE, 
mraY, murD 


2 


477124 


P3A2 DNA 
binding protein 
homolog EWG - 
fruit fly 
(Drosophila 
melanogaster) 


2.8 


41 


S65623 


cAMP-regulated 
enhancer- 
binding protein 
1 of 3] 


2 


119266 


PROTEIN 
GRAINY- 
HEAD (DNA- 
BINDING 
PROTEIN ELF- 
1) (ELEMENT 
I-BINDING 
ACTIVITY) 
regulatory 
protein elf-1 - 
fruit fly 
(Drosophila 
melanogaster) 
>gi|7939|emb 
|CAA33692| 
(X15657) Elf-1 
protein (A A 1- 
1063) 

[Drosophila 
melanogaster] 


0.55 


42 


NM 004415 
.1 


Homo sapiens 
desmoplakin 
(DPI, DPII) 
(DSP) mRNA 
mRNA, 
complete cds 


2 


2649177 


(AE001008) 
conserved 
hypothetical 
protein 

[Archaeoglobus 
fulgidus] 


0.2 


43 


AF031552 


Vibrio cholerae 
magnesium 
transporter 
(mgtE) gene, 
partial cds; 
sensor kinase 
(vieS), response 
regulator 
(vieA), and 
response 
regulator (vieB) 
genes, complete 
cds; and 
collagenase 
Cvcc^ gene. 


2 


2088714 


(AF003139) 
strong similarity 
to NADPH 
oxidases; partial 
CDS, the gene 
begins in the 
neighboring 
clone 


2e-013 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 










(vcc) gene, 
partial cds 










44 


AF116852.1 


Danio rerio 
dickkopf-1 
(dfckl) mRNA, 
complete cds 


2 


3800951 


(AF100657) No 
definition line 
found 

[Caenorhabditis 
elegans] 


2e-019 


45 


X82595 


P. sativum fuc 
gene 


1.9 


<NONE> 


<NONE> 


<NONE 

> 


46 


AF008216 


Homo sapiens 
ofmrliflatp tumor 

suppressor 
pp32rl 


1.9 


<NONE> 


<NONE> 


<NONE 
> 


47 


AF 130672.1 


Felis catus clone 
Fca603 
micvrosatellite 
sequence 


1.9 


<NONE> 


<NONE> 


<NONE 
> 


48 


AJ007044 


Oryctolagus 
Cuniculus sod 
gene 


1.9 


388055 


(L22981) 
merozoite 
surface protein- 
1 [Plasmodium 
chabaudi] 


7.8 


49 


AC004497 


Homo sapiens 
chromosome 21, 
PI clone 
LBNL#6 


1.9 


160925 


(M94346) 
A. 1.12/9 
antigen 
[Schistosoma 
mansoni] 


7.7 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






50 


U30290 


Rattus 
norvegicus 
galanin receptor 
GALR1 mRNA, 
complete cds 


1.9 


3024079 


GALECTIN-4 
(LACTOSE- 
BINDING 
LECTIN 4) (L- 
36 LACTOSE 
BINDING 
PROTEIN) 
(L36LBP) 
>gi |2281707 
sapiens] 
>gi |2623387 
(U82953) 
galectin-4 
[Homo sapiens] 


4.5 


51 


Y13234 


Chironomus 
tentans mRNA 
for chitinase, 
1695 bp 


1.9 


4567068 


(AF 125568) 
tumor 

suppressing STF 
cDNA 4 [Homo 
sapiens] 


3.4 


52 


NM 003644 
.1 


Homo sapiens 
growth arrest- 
specific 7 
(GAS7) mRNA 
> :: 

emb|AJ224876| 
HSAJ4876 
Homo sapience 
mRNA for 
GAS7 protein 


1.9 


125560 


PROTEIN 
KINASE C, 
GAMMA TYPE 
C (EC 2.7.1.-) 
gamma - rabbit 
>gi |165652 
(M19338) 
protein kinase 
delta 

[Oryctolagus 
cuniculus] 


0.53 


53 


AB013448.1 


Oryza sativa 
gene for Pib, 
complete cds 


1.8 


<NONE> 


<NONE> 


<NONE 

> 


54 


D63854 


Human 

r»vrr>TnP k o r a lov it'll q 

DNA, replication 
origin 


1.8 


<NONE> 


<NONE> 


<NONE 


55 


AB002340 


Human mRNA 
for KIAA0342 
gene, complete 
cds 


1.8 


<NONE> 


<NONE> 


<NONE 

> 


56 


AF017779 


Mus musculus 
vitamin D 
receptor gene, 
promoter region 


1.8 


<NONE> 


<NONE> 


<NONE 

> 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 

Redundant 
Proteins) 






57 


D63854 


Human 

cytomegalovirus 

xsLvr\. , I CjJllL-allUll 

origin 


1.8 


<NONE> 


<NONE> 


<NONE 
> 


58 


M24102 


Bovine 
ADP/ATP 
translocase Tl 
mRNA, 
complete cds. 


1.8 


<NONE> 


<NONE> 


<NONE | 
> 


59 


AC004497 


Homo sapiens 
chromosome 21, 
PI clone 
LBNL#6 


1.8 


< NONE > 


<NONE> 


<NONE 
> 


60 


M37394 


Rat epidermal 
growth factor 
receptor mRNA. 


1.8 


<NONE> 


<NONE> 


<NONE 

> 


61 


AF006304 


Saccharomyces 
cerevisiae 
protein tyrosine 
phosphatase 
(PTP3) gene, 
complete cds 


1.8 


<NONE> 


<NONE> 


<NONE 

> 


62 


D 13454 


Candida 
albicans 
CACHS3 gene 
for chitin 
synthase III 


1.8 


<NONE> 


<NONE> 


<NONE 

> 


63 


Y00354 


Xenopus laevis 
gene encoding 
vitellogenin A2 


1.8 


1077580 


hypothetical 
protein 
YDR125c - 
yeast 


7.5 


64 


U90936 


Aspergillus 
niger px27 
gene, promoter 
region 


1.8 


4337033 


(AF124138) 
transcriptional 
activator protein 
CdaR 

[Streptomyces 

coelicolor] 

transcriptional 

regulator 

[Streptomyces 

coelicolor] 


7.3 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






65 


D84448 


Cavia cobaya 
mRNA for 
Na + ,K+- 
ATPase beta-3 
subunit, 
complete cds 


1.8 


4704603 


(AF109916) 

putative 

dehydrin 


7.1 


66 


AF039948 


Xenopus laevis 
clone H-0 
transcription 
elongation factor 
S-II (THIS) 
precursor RNA, 
isoform 

TFIIS.h, partial 
cds 


1.8 


1695839 


(U58151) 

envelope 

glycoprotein 

[Human 

immunodeficien 

cy virus type 1] 


5.6 


67 


M18061 


Xenopus laevis 
vitelloginin 
gene, complete 
cds. 


1.8 


780502 


(U18466) AP 
endonuclease 
class II [African 
swine fever 
virus] 

>gi ) 1097525 |p 
rf| |2113434ET 
AP 

endonuclease: IS 
OTYPE= class 
II [African 
swine fever 
virus] 


3.1 


68 


U61112 


Mus musculus 
Eya3 homolog 
mRNA, 
complete cds 


1.8 


3043646 


(AB011133) 
KIAA0561 
protein [Homo 
sapiens] 


1.9 


69 


AB018442 


Oryza sativa 
mRNA for 
phytochrome C, 
complete cds 


1.8 


4455041 


(AF1 16463) 
unknown 
[Streptomyces 
lincolnensis] 


0.49 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






70 


D63854 


Human 

cytomegalovirus 
DNA, replication 
origin 


1.8 


1169200 


DNA- 

DAMAGE- 

REPAIR/TOLE 

RATION 

PROTEIN 

DRT111 

PRECURSOR 

>gi ] 421829] pi 

r| |S33706 

DNA-damage 

resistance 

protein - 

Arabidopsis 

thaliana and 

DNA-damage 

resistance 

protein 

(DRT111) 

mRNA, 

complete cds.], 

gene product 

[Arabidopsis 

thaliana] 


0.22 


71 


D26549 


Bovine mRNA 
for adseverin, 
complete cds 


1.8 


755468 


(U19879) 

transmembrane 

protein 

[Xenopus laevis] 


0.042 


72 


J05211 


Human 
desmoplakin 
mRNA, 3' end. 


1.8 


728867 


ANTER- 
SPECIFIC 
PROLINE- 
RICH 

PROTEIN APG 
PRECURSOR 
>gi| 99694 |pir 
||S21961 
proline-rich 
protein APG - 
Arabidopsis 
thaliana 
> gi| 22599 1 em 
b|CAA42925| 


0.015 
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Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
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Proteins) 






73 


NM 004415 
.1 


Homo sapiens 
desmoplakin 
(DPI, DPII) 
(DSP) mRNA 
mRNA, 
complete cds 


1.8 


728867 


ANTER- 
SPECIFIC 
PROLINE- 
RICH 

PROTEIN APG 
PRECURSOR 
>gi| 99694 |pir 
||S21961 
proline-rich 
protein APG - 

A 1 • J ■ 

Arabidopsis 

thaliana 

>gi| 22599) em 

b|CAA42925| 


0.015 


74 


AF038604 


Caenorhabditis 
elegans cosmid 
B0546 


1.8 


3877951 


(Z81555) 
predicted using 
Genefinder 


3e-008 


75 


AF038604 


Caenorhabditis 
elegans cosmid 
B0546 


1.8 


3877951 


(Z81555) 
predicted using 
Genefinder 


2e-011 


76 


U23551 


Prochlorothrix 
hollandica 
phosphomannom 
utase 


1.8 


2828280 


(AL021687) 

putative protein 

[Arabidopsis 

thaliana] 

>gi| 2832633 |e 

mb|CAA16762 

| (AL021711) 

putative protein 

[Arabidopsis 

thaliana] 


2e-013 


77 


S60150 


ORFL..ORF6 
{3' terminal 
reigon} 

[chrysanthemum 
virus B CVB, 
Genomic RNA, 
6 genes, 3426 
nt] 


1.8 


1065454 


(U40410) 
C54G7.2 gene 
product 

[Caenorhabditis 
elegans] 


2e-019 ; 


78 


AB014558 


Homo sapiens 
mRNA for 
KIAA0658 
protein, partial 
cds 


1.8 


3850072 


(AL033385) 
dna-directed rna 
polymerase iii 
subunit 

[Schizosaccharo 
myces pombe] 


6e-027 
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(BlastX vs. Non- 
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Proteins) 






79 


X17191 


E. gracilis 

chloroplast 

RNA 

polymerase 
rpoB-rpoCl- 
rpoC2 operon 


1.7 


<NONE> 


<NONE> 


<NONE 
> 


80 


X07729 


R.norvegicus 
gene encoding 
neuron-specific 
enolase, exons 
8-12 


1.7 


4584544 


(AL049608) 
extensin-like 
protein 


8.8 


81 


D38178 


Human gene for 
cytosolic 
phospholipase 
A2, exon 1 


1.7 


73714 


infected cell 
protein ICP34.5 
- human 
herpesvirus 1 
(strain F) 
>gi |330123 
(M12240) 
infected cell 
protein [Herpes 
simplex virus 
type 1] 


1.1 


82 


U23551 


Prochlorothrix 
hollandica 
phosphomannom 
utase 


1.7 


2828280 


(AL021687) 

putative protein 

[Arabidopsis 

thaliana] 

>gi ) 2832633 |e 

mb|CAA16762 

| (AL021711) 

putative protein 

[Arabidopsis 

thaliana] 


2e-010 


83 


Y00525 


Klebsiella 
pneumoniae 
nifL gene for 
regulatory 
protein 


1.6 


3800951 


(AF100657) No 
definition line 
found 

[Caenorhabditis 
elegans] 


6e-013 


84 


AF100170.1 


Bos taurus 
major fibrous 
sheath protein 
precursor, 
mRNA, 
complete cds 


1.5 


463552 


(U05877) AF-1 
[Homo sapiens] 


0.074 


85 


Y13441 


Homo sapiens 
Rox gene, exon 
2 


0.74 


<NONE> 


<NONE> 


<NONE 

> 
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Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






86 


L46792 


Actinidia 
deliciosa clone 
AdXET-5 
xyloglucan 
endotransglycos 
ylase precursor 
(XET) mRNA, 
complete cds 


0.73 


3170252 


(AF043636) 
circumsporozoit 
e protein 
[Plasmodium 
chabaudi] 


0.001 


87 


U73489 


Drosophila 
melanogaster 
Nem (nem) 
mRNA, 
complete cds 


0.7 


3915994 


HYPOTHETIC 

AL53.2KD 

PROTEIN IN 

PRC-PRPA 

INTERGENIC 

REGION 


3e-005 


88 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, 
partial cds 


0.68 


157272 


(LI 1345) DNA- 
binding protein 
[Drosophila 
melanogaster] 


8.5 


89 


AF082012 


Caenorhabditis 
elegans UDP-N- 
acetylglucosami 
ne:a-3-D- 
mannoside b- 
1,2-N- 

acetylglucosami 
nyltransferase I 
(gly-14) mRNA, 
complete cds 


0.67 


2494313 


PUTATIVE 
TRANSLATIO 
N INITIATION 
FACTOR EIF- 
2B SUBUNIT 1 
(EIF-2B GDP- 
GTP 

EXCHANGE 
FACTOR) elF- 
2B, subunit 
alpha - 

Methanococcus 
jannaschii alF- 
2B, subunit 
delta (aIF2BD) 
[Methanococcus 
jannaschii] 


8.4 


90 


U04354 


Mus musculus 
ADSEVERIN 
mRNA, 
complete cds 


0.67 


4755188 


(AC007018) 
unknown protein 


8e-026 
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Genbank) 
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(BlastX vs. Non- 
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Proteins) 






91 


M68881 


S.pombe cigl+ 
gene, complete 
cds. 


0.67 


2078441 


(U56964) weak 
similarity to S. 
cerevisiae 
intracellular 
protein transport 
protein US)1 
(SP:P25386) 


2e-030 


92 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, 
partial cds 


0.66 


2829685 


PROTEIN- 
TYROSINE 
PHOSPHATAS 
EX 

PRECURSOR 

(R-PTP-X) 

(PTP IA- 

2BETA) 

(PROTEIN 

TYROSINE 

PHOSPHATAS 

E-NP) (PTP- 

NP) 

>gi |1515425 
(U57345) 
protein tyrosine 
phosphatase-NP 
[Mus musculus] 


6.2 


93 


Z 15056 


B.subtilis genes 
spoVD, murE, 
mraY, murD 


0.66 


477124 


P3A2 DNA 
binding protein 
homolog EWG - 
fruit fly 
(Drosophila 
melanogaster) 


2.1 


94 


M86808 


Human pyruvate 
dehydrogenase 
complex 
(PDHA2) gene, 
complete cds. 


0.65 


<NONE> 


<NONE> 


<NONE 

> 


95 


J03754 


Rat plasma 
membrane 
Ca2+ ATPase- 
isoform 2 
mRNA, 
complete cds. 


0.65 


4507549 


transmembrane 
protein with 
EGF-like and 
two follistatin- 
like domains 1 
>gi |755466 


8e-006 
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Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






96 


NM 000887 
.1 


Homo sapiens 
integrin, alpha 
X (antigen 
CDllC 

emb|Y00093|H 

SP15095 

H. sapiens 

mRNA for 

leukocyte 

adhesion 

glycoprotein 

pl50,95 


0.64 


<NONE> 


<NONE> 


<NONE 

> 


97 


L27080 


Human 

melanocortin 5 
receptor 
(MC5R) gene, 
complete cds. 


0.64 


<NONE> 


<NONE> 


<NONE 

> 


98 


U07890 


Mus musculus 
C57BL/6J 
epidermal 
surface antigen 
(mesa) mRNA, 
complete cds. 


0.64 


<NONE> 


<NONE> 


<NONE 

> 


99 


AF079139 


Streptomyces 

VcIlCZUcXaC 

pikCD operon, 

complete 

sequence 


0.64 


3041869 


(U96109) 
pronne-ricn 
transcription 
factor ALX3 
[Mus musculus] 


2.8 


100 


M16140 


Chicken 
ovoinhibitor 
gene, exon 15. 


0.64 


123984 


ACROSIN 
INHIBITORS 
HA AND HB 


4e-008 


101 


NM 000887 
.1 


Homo sapiens 
integrin, alpha 
X (antigen 
CDllC 

emb|Y00093|H 
SP 15095 
H. sapiens 
mRNA for 
leukocyte 
adhesion 
glycoprotein 
p 150,95 


0.63 


<NONE> 


<NONE> 


<NONE 

> 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
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Proteins) 






102 


Z17316 


Kluyveromyces 
lactis for gene 
encoding 

rVh n ct^ Vi nfn l r* tr\l<ri 
JpilU&jJIlUlI 

nase beta 
subunit 


0.63 


<NONE> 


<NONE> 


<NONE 

> 


103 


Z25470 


H. sapiens 
melanocortin 5 
receptor gene, 
complete CDS 


0.63 


<NONE> 


<NONE> 


<NONE 

> 


104 


L19954 


Bacillus subtilis 
feuA, B, and C 
genes, 3 ORFs, 
2 complete cds's 
and 5' end. 


0.63 


<NONE> 


<NONE> 


<NONE 

> 


105 


U44405 


Spiroplasma 
citri 

chromosome 
pre-inversion 
border, SPV1- 
like sequences, 
transposase 
gene, partial 
cds, adhesin-like 
protein P58 
gene, complete 
cds. 


0.63 


2499642 


SERINE/THRE 
ONINE- 
PROTEIN 
KINASE STE20 
HOMOLOG 
>gi |1737181 
(U73457) 
Cst20p [Candida 
albicans] 


7.7 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






106 


Z28264 


S.cerevisiae 
chromosome XI 
reading frame 
ORF YKR039w 


0.63 


3880930 


(AL021481) 
similar to 
Phosphoglucom 
utase and 
phosphomannom 
utase 

phosphoserine; 
cDNA EST 
EMBL:D36168 
comes from this 
gene; cDNA 
EST 

EMBL:D70697 
comes from this 
gene; cDNA 
EST yk373h9.5 
comes from this 
gene; cDNA 
EST 

EMBL:T00805 


2e-014 


107 


AE001107 


Archaeoglobus 
fulgidus section 
172 of 172 of 
the complete 
genome 


0.62 


<NONE> 


<NONE> 


<NONE 
> 


108 


Z14112 


B.firmus Top A 
gene encoding 
DNA 

topoisomerase I 


0.62 


310115 


(L02530) 
Drosophila 
polarity gene 
(frizzled) 
homologue 


0.026 


109 


AF118101 


Toxoplasma 
gondii protein 
kinase 6 (tpk6) 
mRNA, 
complete cds 


0.62 


726403 


(U23175) 
similar to anion 
exchange 
protein 

[Caenorhabditis 
elegans] 


4e-018 


110 


M59743 


Rabbit cardiac 
release channel 


0.61 


<NONE> 


<NONE> 


<NONE 


111 


M12036 


Human tyrosine 
kinase-type 
receptor (HER2) 
gene, partial 
cds. 


0.61 


61962 


(X58484) gag 
[Simian foamy 
virus] 


7.5 
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Genbank) 






Nearest Neighbor 
(BlastX vs.. Non- 
Redundant 
Proteins) 






112 


AF043195 


Homo sapiens 
tight junction 
protein ZO (ZO- 
2) gene, 

alternative splice 
products, 
promoter and 
exon A 


0.61 


1572629 


(U69699) 
unknown protein 
precursor [Mus 
musculus] 


7.5 


113 


U18178 


Human HLA 
class I genomic 
survey 
sequence. 


0.61 


1336688 


(S81116) 
properdin 
[guinea pigs, 
spleen, Peptide, 
470 aa] [Cavia] 


5.7 


114 


U44405 


Spiroplasma 
citri 

chromosome 
pre-inversion 
border, SPV1- 
like sequences, 
transposase 
gene, partial 
cds, adhesin-like 
protein P58 
gene, complete 
cds. 


0.61 


2827531 


(AL021633) 
hypothetical 
protein 


3.3 


115 


Z33011 


M.capricolum 
DNA for 
CONTIG 
MC008 


0.61 


3915729 


HYPERPLASTI 
C DISCS 
PROTEIN 
(HYD 
PROTEIN) 
>gi |2673887 
(L14644) 
hyperplastic 
discs protein 


0.26 


116 


NM 001429 
.1 


Homo sapiens 
E1A binding 
protein p300 
mRNA, 
complete cds. 
> :: 

gb| 162297 1 1622 
97 Sequence 1 
from patent US 
5658784 


0.61 


4204294 


(AC003027) 
lcl | prt_seq No 
definition line 
found 


5e-005 
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Nearest 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






117 


Z25418 


C.familiaris 
MHC class lb 
gene (DLA-79) 
gene, complete 
CDS 


0.61 


3877493 


(Z48583) 
similar to 
ATPases 
associated with 
various cellular 
activities 
(AAA); cDNA 
EST 

EMBL:Z14623 
comes from this 
gene; cDNA 
EST 

EMBL:D75090 
comes from this 
gene; cDNA 
EST 

EMBL:D72255 
comes from this 
gene; cDNA 
EST yk200e4.5 


le-007 


118 


AB002150 


Bacillus subtilis 
DNA for FeuB, 
FeuA, YbbB, 
YbbC, YbbD, 
YbzA, YbbE, 
YbbF, YbbH, 
YbbI, YbbJ, 
YbbK, YbbL, 
YbbM, YbbP, 
complete cds 


0.6 


<NONE> 


<NONE> 


<NONE 

> 


119 


Y07786 


V.cholerae 
ORF's involved 
in 

lipopolysacchari 
de synthese 


0.6 


<NONE> 


<NONE> * 


<NONE 

> 














120 


Z17316 


Kluyveromyces 
lactis for gene 
encoding 
phosphofructoki 
nase beta 
subunit 


0.6 


<NONE> 


<NONE> 


<NONE 

> 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






121 


Z71403 


S.cerevisiae 
chromosome 
XIV reading 
frame ORF 
YNL127W 


0.6 


<NONE> 


<NONE> 


<NONE 

> 


122 


L34641 


Homo sapiens 
platelet/ endothel 
ial cell adhesion 
molecule- 1 
(PECAM-1) 
gene, exon 10. 


0.6 


1147634 


(U42213) 
micronemal 
TRAP-C1 
protein homolog 


9.6 


123 


AF070572 


Homo sapiens 
clone 24778 
unknown 
mRNA 


0.6 


399034 


N- 

ACETYLMUR 

AMOYL-L- 

ALANINE 

AMIDASE 

AMIB 

PRECURSOR 
>gi| 628763| pi 
r| |S41741 N- 
acetylmuramoyl- 
L-alanine 
amidase (EC 
3.5.1.28) - 
Escherichia coli 
>gi |304914 
(L19346) N- 
acetylmuramoyl- 
L-alanine 
amidase 
[Escherichia 
coli] N- 

acetylmuramoyl- 
1-alanine 

allllCldSc 11, a 


2.5 


124 


X75627 


C. burnetii trxB, 
spoIIIE and serS 
genes 


0.6 


3036833 


(AJ003163) 
apsB 

[Emericella 
nidulans] 


0.28 


125 


Z99765 


Flaveria pringlei 
gdcsH gene 


0.59 


<NONE> 


<NONE> 


<NONE 

> 
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Nearest 
Neighbor 
(BlastN vs 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 

IVLU LlIJLLlCill L 

Proteins) 






126 


U02538 


Mycoplasma 
hyopneumoniae 
J ATCC 25934 
23S rRNA eene 
partial sequence 


0.59 


<NONE> 


<NONE> 


<NONE 

> 


127 


Z71403 


S.cerevisiae 
chromosome 
XIV reading 
frame ORF 
YNL127w 


0.59 


<NONE> 


<NONE> 


<NONE 
> 


128 


X03942 


Mouse simple 
repetitive DNA 
(sqr family) 
transcript (clone 
pmlc 2) with 
conserved 
GACA/GATA 
repeats 


0.59 

< 


<NONE> 


<NONE> 


<NONE 
> 


129 


U11844 


Mus musculus 
glucose 
transporter 
(GLUT3) gene, 
exon 1 


0.59 


<NONE> 


<NONE> 


<NONE 
> 


130 


D63395 


Homo sapiens 
mRNA for 
NOTCH4, 
partial cds 


0.59 


4433616 


(AF107018) 
alpha- 

mannosidase IIx 
[Mus musculus] 


1.8 


131 


Z33011 


M.capricolum 
DNA for 
CONTIG 
MC008 


0.59 


3915729 


HYPERPLASTI 
C DISCS 
PROTEIN 
(HYD 
PROTEIN) 
>gi |2673887 
(L14644) 
hyperplastic 
discs protein 


0.27 


132 


U05670 


Haemophilus 
influenzae DL42 
Lex2A and 
Lex2B genes, 
complete cds. 


0.58 


<NONE> 


<NONE> 


<NONE 

> 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






133 


L27080 


Human 

melanocortin 5 
receptor 
(MC5R) gene, 
complete cds. 


0.58 


123984 


ACROSIN 
INHIBITORS 
HA AND IIB 


2e-006 


134 


AF043195 


Homo sapiens 
tight junction 
protein ZO (ZO- 
2) gene, 

alternative splice 
products, 
promoter and 
exon A 


0.57 


1572629 


(U69699) 
unknown protein 
precursor [Mus 
musculus] 


6.7 


135 


U57707 


Bos taurus 
activin receptor 
type IIB 
precursor 


0.57 


807646 


(M17294) 
unknown protein 
[Human 
herpesvirus 4] 


0.068 


136 


Z17316 


Kluyveromyces 
lactis for gene 
encoding 
phosphofructoki 
nase beta 
subunit 


0.56 


<NONE> 


<NONE> 


<NONE 

> 


137 


M21535 


Human erg 
protein (ets- 
related gene) 
mRNA, 
complete cds. 


0.56 


<NONE> 


<NONE> 


<NONE 

> 


138 


M64932 


Candida maltosa 
cyclohexamide 
resistance 
protein 


0.56 


3219524 


(AF069428) 
NADH 

dehydrogenase 
subunit IV 
[Alligator 
mississippiensis] 
>gi| 3367630 |e 
mb|CAA73570 
| (Y13113) 
NADH 

dehydrogenase 
subunit 4 
[Alligator 
mississippiensis] 


1.3 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






139 


AE000342 

/ 


Escherichia coli 
K-12 MG1655 
section 232 of 
400 of the 
complete 
genome 


0.56 


3874685 


(Z78539) 
Similarity to 
S.pombe 
hypothetical 
protein 
C4G8.04 
(SW:YAD4 SC 
HPO); cDNA 
EST 

EMBL:D27846 
comes from this 
gene; cDNA 
EST 

EMBL:D27845 
comes from this 
gene; cDNA 
EST yk202h7.3 
comes from this 
gene; cDNA 
EST yk202h7.5 
come... 


0.088 


140 


Z 15056 


B.subtilis genes 
spoVD, murE, 
mraY, murD 


0.55 


477124 


P3A2 DNA 
binding protein 
homolog EWG - 
fruit fly 
(Drosophila 
melanogaster) 


3.7 


141 


Z58167 


H. sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
30el0, forward 
read 

cpg30el0.ftlb 


0.53 


<NONE> 


<NONE> 


<NONE 
> 


142 


M27159 


Rat potassium 
channel-Kv2 
gene, partial 
cds. 


0.53 


1850920 


(U21247) Bet 
[Human 

spumaretrovirus 

] 


0.9 


143 


M15555 


Mouse Ig 
germline V- 
kappa-24 chain 
(VK24C) gene, 
exons 1 and 2. 


0.24 


<NONE> 


<NONE> 


<NONE 

> 
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Neighbor 
(BlastN vs. 
s Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






144 


U95097 


Xenopus laevis 
mitotic 

phosphoprotein 
43 mRNA, 
partial cds 


0.24 


399109 


TRANSCRIPT! 
ON FACTOR 
BF-1 (BRAIN 
FACTOR 1) 
(BF1) 

>gi| 92020 |pir 
| |IH0672 brain 
factor 1 protein 
- rat 

>gi |203135 
(M87634) BF-1 
[Rattus 
norvegicus] 


4 


145 


AJ002014 


Crythecodinium 
cohnii mRNA 
for nuclear 
protein JUS1 


0.24 


416704 


BALBIANI 
RING 

PROTEIN 3 
PRECURSOR 
balbiani ring 3 
(BR3) 

[Chironomus 
tentans] 


0.36 


146 


L35330 


Rattus 
norvegicus 
glutathione S- 
transferase Yb3 
subunit gene, 
complete cds. 


0.23 


1388158 


(U58204) 
myomesin 
[Gallus gallus] 


8.8 


147 


NM 001432 
.1 


Homo sapiens 
epiregulin 
(EREG) mRNA 
> :: 

dbj|D30783|D3 
0783 Homo 
sapiens mRNA 
for epiregulin, 
complete cds 


0.23 


2851520 


TRANSFORMI 
NG GROWTH 
FACTOR 
ALPHA 
PRECURSOR 
(TGF-ALPHA) 
(EGF-LIKE 
TGF) (ETGF) 
(TGF TYPE 1) 
precursor - rat 
>gi |207282 
(M31076) 
transforming 
growth factor 
alpha precursor 
[Rattus 
norvegicus] 


2e-008 
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Nearest 

^VT/z» 1 (YnVvAl* 

iNcignDor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
visiasiA. vs. in on- 
Redundant 
Proteins) 






148 


U57043 


Cebus apella 
gamma globin 
(gamma 1) gene, 
complete cds 


0.22 


<NONE> 


<NONE> 


<NONE 
> 


149 


AB023188.1 


Homo sapiens 
mRNA for 
KIAA0971 
protein, 
complete cds 


0.22 


<NONE> 


<NONE> 


<NONE 

> 


150 


M18105 


Yeast 

(S.cerevisiae) 
SST2 gene 
encoding 
desensitization 
to alpha- factor 
pheromone, 
complete cds. 


0.22 


<NONE> 


<NONE> 


<NONE 

> 


151 


AJ001113 


Homo sapiens 
UBE3A gene, 
exon 16 


0.22 


3122961 


ENHANCER 
OF SPLIT 
GROUCHO- 
LIKE PROTEIN 
1 >gi |2408145 
(U18775) 
enhancer of split 
groucho 


8.5 


152 


L35330 


Rattus 
norvegicus 
glutathione S- 
transferase Yb3 
subunit gene, 
complete cds. 


0.22 


1388158 


(U58204) 
myomesin 
[Gallus gallus] 


8.1 


153 


D42042 


Human mRNA 
for KIAA0085 
gene, partial cds 


0.22 


4827063 


zinc finger 
protein 142 
(clone pHZ-49) 
>gi| 3123312 |s 
p|P52746|Z142 

HUMAN 
ZINC FINGER 
PROTEIN 142 
(KIAA0236) 
(HA4654) 
>gi| 1510147 |d 
bj| BAA 13242| 


6.1 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






154 


L35330 


Rattus 
norvegicus 
glutathione S- 
transferase Yb3 
subunit gene, 
complete cds. 


0.22 


2853301 


(AF007194) 
mucin [Homo 
sapiens] 


1.6 


155 


Z11653 


H. sapiens DBH 
gene complex 
repeat 

polymorphism 
DNA 


0.22 


3819705 


(AL032824) 
syntaxin binding 
protein 1; seel 
family secretor 
y protein 
[Schizosaccharo 
myces pombe] 


1.2 


156 


L29063 


Candida 
albicans fatty 
acid synthase 
alpha subunit 
CFAS2> gene 
complete cds. 


0.22 


3046871 


(AB003753) 
high sulfur 
protein B2E 
[Rattus 
n orvp o - i en <?1 


0.32 


157 


M64865 


Horse alcohol 
dehydrogenase- 
S-isoenzyme 
mRNA, 
complete cds. 


0.22 


2213909 


(AF004874) 
latent TGF-beta 
binding protein- 
2 [Mus 
musculus] 


0.037 


158 


Y09472 


B.taurus gene 
encoding 
preprododecape 
ptide 


0.21 


2909874 


(AF047829) 
melatonin- 
related receptor 
[Ovis aries] 


7.6 


159 


Y09472 


B.taurus gene 
encoding 
preprododecape 
ptide 


0.21 


2909874 


(AF047829) 
melatonin- 
related receptor 
[Ovis aries] 


7.5 


160 


X80301 


N.tabacum axi 1 
gene 


0.21 


2832715 


(AJ003066) 
subunit beta of 
the 

mitochondrial 
fatty acid beta- 
oxydation 
multienzyme 
complex [Bos 
taurus] 


6 
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Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






161 


AF073485 


Homo sapiens 
MHC class I- 
related protein 
MR1 precursor 
(MR1) gene, 
partial cds 


0.21 


2224559 


(AB002307) 
KIAA0309 
[Homo sapiens] 


3.3 


162 


S78251 


growth hormone 
receptor 
{alternatively 
spliced, exon 
IB} [sheep, 
Merino, skeletal 
muscle, mRNA. 
Partial, 438 nt] 


0.21 


729381 


DYNAMIN-1 

(DYNAMIN 

BREDNM19) 


2 


163 


U16135 


Synechococcus 
sp. Clp protease 
proteolytic 
subunit 


0.21 


135514 


T-CELL 
RECEPTOR 
BETA CHAIN 
PRECURSOR 
precursor (ANA 
11) - rabbit 


0.02 


164 


X95601 


M.hominis lmp3 
and lmp4 genes 


0.21 


2995445 


(Y10496) CDV- 
1 protein [Mus 
musculus] 


0.005 


165 


X95601 


M.hominis lmp3 
and lmp4 genes 


0.21 


2995447 


(Y10495) CDV- 
1R protein [Mus 
musculus] 


0.005 


166 


AF124249.1 


Homo sapiens 
SH2-containing 
protein Nspl 
mRNA, 
complete cds 


0.21 


423456 


epidermal 
growth factor- 
receptor-binding 
protein GRB-4 - 
mouse 
(fragment) 


8e-010 


167 


AF030282 


Danio rerio 
homeobox 
protein Six7 
(six/) mRNA, 
complete cds 


0.21 


3928083 


(AC005770) 
unknown protein 
[Arabidopsis 
thaliana] 


2e-014 


168 


X83427 


O.anatinus 
mitochondrial 
DNA, complete 
genome 


0.21 


132575 


RIBONUCLEA 
SE INHIBITOR 


3e-021 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






169 


AJ001113 


Homo sapiens 
UBE3A gene, 
exon 16 


0.2 


<NONE> 


<NONE> 


<NONE 

> 


170 


AF081533.1 


Anopheles 
gambiae 
putative gram 
negative bacteria 
binding protein 
gene, complete 
cds 


0.2 


<NONE> 


<NONE> 


<NONE 

> 


171 


U70316 


Dictyostelium 
discoideum 
IonA (iona) 
gene, partial cds 


0.2 


<NONE> 


<NONE> 


<NONE 
> 


172 


AF009341 


Homo sapiens 
E6-AP 

ubiquitin-protein 
ligase 


0.2 


<NONE> 


<NONE> 


<NONE 
> 


173 


L35330 


Rattus 
norvegicus 
glutathione S- 
transferase Yb3 
subunit gene, 
complete cds. 


0.2 


3702275 


(AC005793) 
KIAA0561 
protein [A A 1- 
593] [Homo 
sapiens] 


2.5 


174 


AE000573.1 


Helicobacter 
pylori 26695 
section 51 of 
134 of the 
complete 
genome 


0.2 


3947855 


(AL034381) 
putative Golgi 
membrane 
protein 


2.5 


175 


X83230 


G.gallus 
hsp90beta gene 


0.2 


3258596 


(U95821) 

putative 

transmembrane 

GTPase 

[Drosophila 

melanogaster] 


0.81 


176 


X57157 


Chicken mRNA 
for Hsp47, heat 
shock protein 47 


0.2 


108325 


insulin-like 
growth factor- 
binding protein 
6 


0.17 
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(BlastN vs 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 

Rpdiindant 

lw V-i LI J. 1\X Cli.lL 

Proteins) 






177 


M58748 


Chicken alpha- 
globin gene 
domain with 
structural matrix 
attachment sites. 


0.2 


1086863 


(U41272) 
T03G1 1.6 gene 
product 

[Caenorhabditis 
elegans] 


4e-005 


178 


AB016815 


Anthocidaris 
crassispina 
mRNA for Src- 
type protein 
tyrosine kinase, 
complete cds 


0.2 


423456 


epidermal 
growth factor- 
receptor-binding 
protein GRB-4 - 
mouse 
(fragment) 


le-012 


179 


AF030282 


Danio rerio 
homeobox 
protein Six7 
(six7) mRNA, 
complete cds 


0.2 


3928083 


(AC005770) 
unknown protein 
[Arabidopsis 
thaliana] 


3e-014 


180 


AL035559 


Streptomyces 
coelicolor 
cosmid 9F2 


0.2 


2088714 


(AF003139) 
strong similarity 
to NADPH 
oxidases; partial 
CDS, the gene 
begins in the 

n p i o \\ b o r i n o 

clone 


3e-022 


181 


S79641 


SDH = succinate 
dehvdroffenase 
flavoprotein 
subunit Mutant, 
387 nt] 


0.2 


4755188 


(AC007018) 

unknown nrntpin 


2e-022 


182 


X75383 


H. sapiens 
mRNA for 
TFIIA-alpha 


0.19 


<NONE> 


<NONE> 


<NONE 

> 


183 


U53901 


Hippopotamus 
amphibius b- 
casein gene, 
exon 7, partial 
cds 


0.19 


<NONE> 


<NONE> 


<NONE 
> 


184 


J05265 


Mouse 
interferon 
gamma receptor 
mRNA, 
complete cds. 


0.19 


77356 


hypothetical 
70K protein - 
eggplant mosaic 
virus 


0.0005 
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(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






185 


U72353 


Rattus 
norvegicus 
lamin Bl 
mRNA, 
complete cds 


0.19 


3880857 


(AL031633) 
cDNA EST 
yk404dl.5 
comes from this 
gene; cDNA 
EST yk404dL3 
comes from this 
gene 


2e-006 


186 


AB016815 


Anthocidaris 
crassispina 
mRNA for Src- 
type protein 
tyrosine kinase, 
complete cds 


0.19 


3930217 


(AF047487) 
Nck-2 [Homo 
sapiens] 


2e-007 


187 


D10911 


Mus musculus 
DNA for MS2 
protein, 


0.19 


2662366 


(D86332) 
membrane type- 
2 matrix 

lllCUUlUpiUlCillaa 

e [Mus 
musculus] 


5e-011 


188 


AB015345 


Homo sapiens 
HRIHFB2216 
mRNA, partial 
cds 


0.075 


3877417 


(Z66564) 
similar to anion 
exchange 
protein 


6.4 


189 


AF086410 


Homo sapiens 
full length insert 
cDNA clone 
ZD77B03 


0.075 


3023371 


PHEROMONE 
B BETA 1 
RECEPTOR 


4.9 


190 


K02024 


Human T~cell 
lymphotropic 
virue type II env 
gene encoding 
envelope 
glycoprotein, 
complete cds. 


0.075 


2791527 


(AL021246) 
PE_PGRS 
[Mycobacterium 
tuberculosis] 


0.11 
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Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






191 


M10188 


X.laevis 
mitochondrial 
DNA containing 
the D-loop, and 
the 12S rRNA, 
apocytochrome 
b, Glu-tRNA, 
Thr-tRNA, Pro- 
tRNA and Phe- 
tRNA genes. 


0.074 


4753163 


huntingtin 
DISEASE 
PROTEIN) (HD 
PROTEIN) 
>gi |454415 
(L12392) 
Huntington's 
Disease protein 
[Homo sapiens] 


2.8 


192 


X85525 


G.gallus AG 
repeat region 
(GgaMU130) 


0.073 


984339 


(U20966) Rev 
Ipimian 

immunodeficien 
cy virus] 


3.6 


193 


AJ238394.1 


Homo sapiens 
AML2 gene 
(partial) 


0.07 


4240219 


(AB020672) 
KIAA0865 
protein [Homo 
sapiens] 


2 


194 


AF039704 


Homo sapiens 

lysosomal 

pepstatin 

insensitive 

protease (CLN2) 

gene, complete 

cds 


0.069 


2894106 


(Z78279) 
Collagen alpha 1 
[Rattus 
norvegicus] 


0.39 


195 


K02024 


Human T-cell 
lymphotropic 
virue type II env 
gene encoding 
envelope 
glycoprotein, 
complete cds. 


0.068 


4504857 


potassium 
intermediate/sm 
all conductance 
calcium- 
activated 
channel, 
subfamily N, 
member 3 
>gi |3309531 
(AF031815) 
calcium- 
activated 
potassium 
channel [Homo 
sapiens] 


0.5 
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Table 2 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






196 


Z60719 


H. sapiens CpG 
island DNA 
genomic Msel 
fragment, clone 
33all, forward 
read 

cpg33all.ftlm 


0.068 


4826874 


nucleoporin 
214kD (CAIN) 
PROTEIN 
NUP214 
(NUCLEOPORI 
N NUP214) 
(214 KD 
NUCLEOPORI 
N) transforming 
protein (can) - 
human sapiens] 


0.044 


197 


AF053994 


Lycopersicon 
esculentum 
Hcr2-0A (Hcr2~ 
OA) gene, 
complete cds 


0.068 


2842699 


PUTATIVE 

UBIQUITIN 

CARBOXYL- 

TERMINAL 

HYDROLASE 

C6G9.08 

(\ JRTOT TTTTN 

THIOLESTERA 

SE) 

(UBIQUITIN- 
SPECIFIC 
PROCESSING 
PROTEASE) 


9e-009 


198 


AJ233650.1 


Equus caballus 
endogenous 
retroviral 
sequence ERV- 
L pol gene, 
clone ERV-L 
Horse 1 


0.067 


<NONE> 


<NONE> 


<NONE 

> 


199 


M10188 


X.laevis 
mitochondrial 
DNA containing 
the D-loop, and 

apocytochrome 
b, Glu-tRNA, 
Thr-tRNA, Pro- 
tRNA and Phe- 
tRNA genes. 


0.067 


4753163 


huntingtin 
DISEASE 
PROTEIN) (HD 
PROTEIN) 

cri 1 A*\AA1 ^ 
gl | ^fD^t^¥lD 

(L12392) 
Huntington's 
Disease protein 
[Homo sapiens] 


2.5 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






200 


U 14646 


Murine hepatitis 
virus Y strain S 
glycoprotein 
gene, complete 
cds. 


0.067 


3880930 


(AL021481) 
similar to 
Phosphoglucom 
utase and 
phosphomannom 
utase 

phosphoserine; 
cDNA EST 
EMBL:D36168 
comes from this 
gene; cDNA 
EST 

EMBL:D70697 
comes from this 
gene; cDNA 
EST yk373h9.5 
comes from this 
gene; cDNA 

rio 1 

EMBL:T00805 


le-019 


201 


X15373 


Mouse 
cerebellum 
mRNA for P400 
protein 


0.066 


164507 


(M81771) 
immunoglobulin 
gamma-chain 
[Sus scrofa] 


9.4 


202 


AF086410 


Homo sapiens 
full length insert 
curN/\ cione 
ZD77B03 


0.066 


3023371 


PHEROMONE 
B BETA 1 


4.2 


203 


AL034492 


Streptomyces 
coelicolor 
cosmid 6C5 


0.066 


3800951 


(AF100657) No 
definition line 
found 

[Caenorhabditis 
elegans] 


3e-015 


204 


L13377 


Staphylococcus 
aureus 
enterotoxin 
gene, 3' end. 


0.065 


<NONE> 


<NONE> 


<NONE 

> 


205 


U83478 


Thelephoraceae 
sp. Taylor #13' 
ITS1, 5.8S 
ribosomal RNA 
gene, and ITS2, 
complete 
sequence 


0.065 


3877335 


(Z92786) 
predicted using 
Genefinder 


9.1 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs Non- 
Redundant 
Proteins) 






206 


AJ002014 


Crythecodinium 
cohnii mRNA 
for nuclear 
protein JUS1 


0.065 


1213283 


(U40576) SIM2 
[Mus musculus] 


0.47 


207 


AB016804 


Aloe 

arborescens 
mRNA for 
NADP-malic 
enzvtne 
complete cds 


0.065 


2832777 


(AL021086) 
/prediction = (me 
thod:; comes 
from the 5' 
UTR 

[Drosophila 
melanogaster] 


5e-036 


208 


AJ002014 


Crythecodinium 
cohnii mRNA 
for nuclear 
protein JUS1 


0.063 


1213283 


(U40576) SIM2 
[Mus musculus] 


0.45 


209 


AB023 143.1 


Homo sapiens 
mRNA for 
KIAA0926 
protein, 
complete cds 


0.024 


132575 


RIBONUCLEA 
SE INHIBITOR 


8e-026 


210 


U72966 


Human 
hepatocyte 
nuclear factor 4- 
alpha gene, 
exon 7 


0.022 


<NONE> 


<NONE> 


<NONE 

> 


211 


X02801 


Mouse gene for 
fflial fibrillary 

gllUl 1XU JL XXX%XX J 

acidic protein 


0.022 


2231607 


(U85917) nef 
nrotein THuman 
immunodeficien 
cy virus type 1] 


7 


212 


AF017636 


Mesocricetus 
auratus 3-keto- 
steroid reductase 


0.022 


2723362 


(AF023459) 
lustrin A 
[Haliotis 
rufescens] 


0.097 


213 


Z36879 


F.pringlei 
gdcsPA gene for 
P-protein of the 
glycine cleavage 
system 


0.008 


<NONE> 


<NONE> 


<NONE 

> 


214 


X73150 


P. sativum 
GapCl gene 


0.008 


1572629 


(U69699) 
unknown protein 
precursor [Mus 
musculus] 


8.6 
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Table 2 


1 




Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






215 


AJ239031.1 


Homo sapiens 
LSS gene, 
partial, exons 
22, 23 and 
joined CDS 


0.008 


4508019 


zinc finger 
protein 231 
protein [Homo 
sapiens] 


0.01 


216 


U76602 


Human 180 kDa 
bullous 
pemphigoid 
antigen 2/type 
XVII collagen 
(BPAG2/COL17 
Al) gene, exons 
49, 50, 51 and 
52 


0.007 


3170252 


(AF043636) 
circumsporozoit 
e protein 
[Plasmodium 
chabaudi] 


0.0001 


217 


Ml 1283 


Aplysia 
californica 
FMRFamide 
mRNA, partial 
cds, clone 
FMRF-2. 


0.007 


3874685 


(Z78539) 
Similarity to 
S.pombe 
hypothetical 
protein 
C4G8.04 
(SW:YAD4 SC 
HPO); cDNA 
EST 

EMBL:D27846 
comes from this 
gene; cDNA 
EST 

EMBL:D27845 
comes from this 
gene; cDNA 
EST yk202h7.3 
comes from this 
gene; cDNA 
EST yk202h7.5 
come... 


9e-013 


218 


J03998 


P. falciparum 
glutamic acid- 
rich protein 
gnen, complete 
cds. 


0.003 


<NONE> 


<NONE> 


<NONE 
> 


219 


Z23143 


M.musculus 
ALK-6 mRNA, 
complete CDS 


0.002 


2393890 


(AF006064) 
protein kinase 
homolog 
[Fowlpox virus] 


le-011 
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Table 2 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






220 


AB007914 


Homo sapiens 
mRNA for 
KIAA0445 
protein, 
complete cds 


0.001 


2136964 


cysteine-rich 
hair keratin 
associated 
protein - rabbit 
> gi| 510541 |e 
mb|CAA56339 
| (X80035) 
cysteine rich 
hair keratin 
associated 
protein 


1.9 


221 


AB012105 


Brassica rapa 
mRNA for 
SLG45, 
complete cds 


0.0008 


3687246 


(AC005169) 

putative 

suppressor 

protein 

[Arabidopsis 

thaliana] 


5.5 


222 


L41608 


Methylobacteriu 
m extorquens 
(clone pDN9, 
HINDIIIAB) 
mxaS gene 3' 
end, mxaA, 
mxaC, mxaK, 
mxaL and mxaD 
genes, complete 
cds. 


0.0008 


3024235 


NERVOUS- 
SYSTEM 
SPECIFIC 
OCTAMER- 
BINDING 
TRANSCRIPTI 
ON FACTOR 
N-OCT 3 
PROTEIN) 


5.1 


223 


AB007914 


Homo sapiens 
mRNA for 
KIAA0445 
protein, 
complete cds 


0.0008 


2136964 


cysteine-rich 
hair keratin 
associated 
protein - rabbit 
> gi| 510541 je 
mb|CAA56339 
[ (X80035) 
cysteine rich 
hair keratin 
associated 
protein 


2.5 


224 


AC002293 


Genomic 
sequence from 
Human 9q34, 
complete 
sequence [Homo 
sapiens] 


0.0008 


2789557 


(AF034316) 
MHC class I 
antigen [Triakis 
scyllium] 
scyllium] 


0.0002 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






225 


L16013 


Rattus 

like gene 
sequence 


9e-005 


<NONE> 


<NONE> 


<NONE 


226 


AF148512.1 


Homo sapiens 
hexokinase II 
gene, promoter 


9e-005 


<NONE> 


<NONE> 


<NONE 
> 


227 


U94776 


Human muscle 
glycogen 
phosphorylase 
(PYGM) gene, 
exons 6 through 
17 


9e-005 


4759138 


solute carrier 
family 7 
transporter 3 
[Homo sapiens] 


5.4 


228 


X56030 


H. sapiens IAPP 
gene for 
amyloid 
polypeptide, 
exon 1 


le-005 


<NONE> 


<NONE> 


<NONE 

> 


229 


U36515 


Human CT 
microsatellite, 
clone GM5927- 
CT-2-3, from 
the tandemly 
repeated genes 
encoding U2 
small nuclear 
RNA (RNU2 
locus) 


4e-007 


2435616 


(AF026215) No 
definition line 
found 

[Caenorhabditis 
elegans] 


0.85 


230 


AB011119 


Homo sapiens 
mRNA for 
KIAA0547 
protein, 
complete cds 


4e-007 


4758508 


airway trypsin- 
like protease 
protease [Homo 
sapiens] 


3e-031 


231 


NM 000521 
.1 


Homo sapiens 
hexosaminidase 
B (beta 
polypeptide) 
(HEXB) mRNA 


5e-008 


2119379 


slow muscle 
troponin T - 
rhiflcpn T 

[Gallus gallus] 


2.8 


232 


X13895 


Human serum 
amyloid A 
(GSAA1) gene, 
complete cds 


4e-008 


699405 


(U18682) novel 
antigen receptor 
[Ginglymostoma 
cirratum] 


7.7 
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Table 2 






Nearest 
in eignour 
(BlastN vs. 
Genbank) 






Nearest Neighbor 

^IjlaSlA. Vo. XNun- 

Redundant 
Proteins) 






233 


AB009288.1 


Homo sapiens 
mRNA for N- 
copine, 
complete cds 


4e-008 


4520342 


(AB008893) N- 
copine [Mus 
musculus] 


3e-006 


234 


AB011119 


Homo sapiens 
mRNA for 
KIAA0547 
protein, 
complete cds 


4e-008 


4758508 


airway trypsin- 
like protease 
protease [Homo 
sapiens] 


le-028 


235 


X13895 


Human serum 
amyloid A 
(GSAA1) gene, 
complete cds 


5e-009 


699405 


(U18682) novel 
antigen receptor 
[Ginglymostoma 
cirratum] 


7.8 


236 


X13895 


Human serum 
amyloid A 
(GSAA1) gene, 
complete cds 


2e-009 


699405 


(U18682) novel 
antigen receptor 
[Ginglymostoma 
cirratum] 


7.2 


237 


U64997 


Bos taurus 
ribonuclease K6 
gene, partial cds 


2e-009 


3914810 


RIBONUCLEA 
SEK6 

PRECURSOR 
(RNASE K6) 
>gi |2745760 
(AF037086) 
ribonuclease k6 
precursor 


3e-018 


238 


J02635 


Rat liver alpha- 
2-macroglobulin 
mRNA, 
complete cds. 


2e-009 


112913 


ALPHA-2- 

MACROGLOB 

ULIN 

PRECURSOR 
precursor - rat 
>gi |202592 
(J02635) 
prealpha-2- 

m a c v n o - 1 obn 1 i n 

[Rattus 
norvegicus] 


4e-019 


239 


Z78141 


M.musculus 
partial cochlear 
mRNA (clone 
29C9) 


5e-010 


3219569 


(AL023893) 
/prediction = (me 
thod:; 


4e-009 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






240 


AF060917 


Gambusia 
affinis 

microsatellite 
Gafu6 


2e-010 


3874618 


(Z48241) 
similar to coiled 
coil domains; 
cDNA EST 
yk302gl2.5 
comes from this 
gene; cDNA 
EST 

yk365dl0.5 
comes from this 
gene; cDNA 
EST yk461cl.5 
comes from this 
gene 

[Caenorhabditis 
elegans] coil 
domains; cDNA 
EST 

comes from this 
gene; cDNA 
EST 


0.096 


241 


U68138 


Human PSD-95 
mRNA, partial 
cds 


2e-010 


4521241 


(AB024927) 
CsENDO-3 
[Ciona savignyi] 


2e-022 


242 


U88827 


Aotus trivirgatus 
ribonuclease 
precursor gene, 
complete cds 


6e-011 


3914810 


RIBONUCLEA 
SEK6 

PRECURSOR 
(RNASE K6) 
>gi 1 2745760 
(AF037086) 
ribonuclease k6 
precursor 


le-016 


243 


AF045573 


Mus musculus 
FLI-LRR 
associated 
protein- 1 
mRNA, 
complete cds 


2e-012 


3025718 


(AF045573) 
FLI-LRR 
associated 
protein- 1 [Mus 
musculus] 


3e-016 
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Table 2 





Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






244 


NM 001365 
.1 


Homo sapiens 
discs, large 
(Drosophila) 
homolog 4 
(DLG4) mRNA 
> :: 

gb|U83192|HS 
U83192 Homo 
sapiens post- 
synaptic density 
protein 95 
(PSD95) 
mRNA, 
complete cds 


2e-012 


4521241 


(AB024927) 
CsENDO-3 
[Ciona savignyi] 


5e-020 


245 


U28049 


Human TBX2 
(TXB2) mRNA, 
complete cds. 


7e-013 


2501115 


TBX2 

PROTEIN (T- 
BOX PROTEIN 

2) 


2e-011 


246 


M23404 


Chicken 
erythrocyte 
anion transport 

piUlClIl yVa.LUJ.DJ 

mRNA, 
complete cds. 


2e-013 


726403 


(U23175) 
similar to anion 
exchange 
piuicui 

[Caenorhabditis 
elegans] 


le-025 


247 


AF005963 


Homo sapiens 
XY homologous 

ICglUll, pal Llcll 

sequence 


le-014 


104270 


Ig heavy chain - 
clawed frog 


1.9 


248 


M29863 


Human farnesyl 
pyrophosphate 
synthetase 
mRNA 


9e-015 


182405 


(M29863) 
farnesyl 
pyrophosphate 
synthetase 
[Homo sapiens] 


0.005 


249 


D28126 


Human gene for 
ATP synthase 
alpha subunit, 
complete cds 
(exon 1 to 12) 


3e-015 


<NONE> 


<NONE> 


<NONE 

> 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






250 


Z80150 


H. sapiens 
CACNL1A4 
gene, exons 41 
and 42 > :: 
emb|A70716.1| 
A70716 
Sequence 37 
from Patent 
W098 13490 


3e-015 


3387914 


(AF070550) 
cotel [Homo 
sapiens] 


3.5 


251 


U28049 


Human TBX2 
(TXB2) mRNA, 


4e-016 


2501116 


TBX2 

PROTEIN (T- 
BOX PROTEIN 
2) tbx gene 
[Mus musculus] 


6e-009 


252 


U31629 


Mus musculus 
C2C 12 unknown 
mRNA, partial 
cds. 


le-017 


3024998 


HYPOTHETIC 
AL HEART 
PROTEIN 


3e-017 


253 


J05262 


Human farnesyl 
pyrophosphate 
synthetase 
mRNA, 
complete cds. 


le-018 


182405 


(M29863) 
farnesyl 
pyrophosphate 
synthetase 
[Homo sapiens] 


0.0001 


254 


D28126 


Human gene for 
ATP synthase 
alpha subunit, 
complete cds 
(exon 1 to 12) 


5e-019 


<NONE> 


<NONE> 


<NONE 
> 


255 


D28126 


Human gene for 
ATP synthase 
alpha subunit, 
complete cds 
(exon 1 to 12) 


5e-019 


3219984 


HYPOTHETIC 

AL PROTEIN 

MJ1597.1 

region 

MJ1597.1 

[Methanococcus 

jannaschii] 


5.7 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






256 


NM 004587 
.1 


Homo sapiens 
ribosome 
binding protein 
1 (dog 180kD 
homolog) 

/r> nnni \ 
(JKJtCt>.r 1) 

mRNA > :: 
gb|AF006751| 
AF006751 
Homo sapiens 
ES/130 mRNA, 
complete cds 


2e-019 


4759056 


ribosome 
binding protein 
1 (dog 180kD 
homolog) 
>gi |3299885 

(AruUo id \) 
ES/130 [Homo 
sapiens] 


0.004 


257 


U89915 


Mus musculus 
junctional 
adhesion 
molecule (Jam) 
mRNA, 
complete cds 


5e-020 


3462455 


(U89915) 
junctional 
adhesion 
molecule [Mus 
musculus] 


2e-005 


258 


AF045573 


Mus musculus 
FLI-LRR 
associated 
protein- 1 
mRNA, 
complete cds 


5e-020 


3025718 


(AF045573) 
FLI-LRR 
associated 
protein- 1 [Mus 
musculus] 


9e-025 


259 


NM 004587 
.1 


Homo sapiens 
ribosome 
binding protein 
1 (dog 180kD 
homolog) 

^JvJtvDJrl ) 

mRNA > :: 
gb[AF006751| 
AF006751 
Homo sapiens 
ES/130 mRNA, 
complete cds 


2e-020 


4759056 


ribosome 
binding protein 
1 (dog 180kD 
homolog) 
>gi |3299885 

{J\r\J\JO / D L ) 

ES/130 [Homo 
sapiens] 


0.0008 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






260 


AF051098 


Mus musculus 
seven 

transmembrane 
domain orphan 
receptor mRNA, 
complete cds 


2e-021 


3858883 


(U67056) 
myosin I heavy 
chain kinase 
[Acanthamoeba 
castellanii] 
>gi [4206769 
(AF104910) 
myosin I heavy 
chain kinase 
[Acanthamoeba 
castellanii] 


0.002 


261 


AF051098 


Mus musculus 
seven 

transmembrane 
domain orphan 
receptor mRNA, 
complete cds 


2e-021 


3858883 


(U67056) 
myosin I heavy 
chain kinase 
[Acanthamoeba 
castellanii] 
>gi| 4206769 
(AF104910) 
myosin I heavy 
chain kinase 
[Acanthamoeba 
castellanii] 


0.001 


262 


M13519 


Human N- 
acetyl-beta- 
glucosaminidase 
(HEXB) 
mRNA, 3* end. 


2e-021 


4504373 


hexosaminidase 
B (beta 
polypeptide) 
>gi| 123081 |sp 
|P07686|HEXB 
HUMAN 
BETA- 

HEXOSAMINI 
DASE BETA 
CHAIN 
PRECURSOR 
beta-N- 

acetylhexosamin 
idase (EC 
3.2.1.52) beta 
chain - human 
>gi |386770 
(M23294) beta- 
hexosaminidase 
beta-subunit 
[Homo sapiens] 


2e-007 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






263 


Z81014 


Human DNA 
sequence from 
cosmid U65A4, 
between 
markers 
DXS366 and 
DXS87 on 

chromosome X 

* 


2e-022 


<NONE> 


<NONE> 


<NONE 

> 


264 


AF147311.1 


Homo sapiens 
full length insert 
cDNA clone 
YA82F10 


2e-022 


3875904 


(Z70207) 
predicted using 
Genefinder; 
similar to 
collagen; cDNA 
EST 

EMBL:D65905 
comes from this 
gene; cDNA 
EST 

EMBL:D65858 
comes from this 
gene; cDNA 
EST 

EMBL:D69306 
comes from this 
gene; cDNA 
EST 

EMBL:D65755 
comes from this 
gen... 


0.07 


265 


AF037088 


Gorilla gorilla 
ribonuclease k6 
precursor, gene, 
complete cds 


9e-024 


3914791 


RIBONUCLEA 
SEK6 

PRECURSOR 
(RNASE K6) 
>gi |2745752 
(AF037082) 
ribonuclease k6 
precursor 


3e-019 


266 


Z81014 


Human DNA 
sequence from 
cosmid U65A4, 
between 
markers 
DXS366 and 
DXS87 on 
chromosome X 


8e-024 


<NONE> 


<NONE> 


<NONE 

> 



111 



WO 01/72781 



PCT/US01/09952 



Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 










* 










267 


AF037088 


Gorilla gorilla 
ribonuclease k6 
precursor, gene, 
complete cds 


9e-025 


3914810 


RIBONUCLEA 
SEK6 

PRECURSOR 
(RNASE K6) 
>gi |2745760 
(AF037086) 
ribonuclease k6 
precursor 


4e-018 


268 


AF147311.1 


Homo sapiens 
full length insert 
cDNA clone 
YA82F10 


le-026 


131413 


PULMONARY 
SURFACTANT 
-ASSOCIATED 
PROTEIN A 
PRECURSOR 
(SP-A) (PSP-A) 
(PSAP) 
precursor - 
rabbit 

>gi] 165706 

(J03542) 

apoprotein of 

surfactant 

[Oryctolagus 

cuniculus] 


0.059 


269 


Z46786 


D.melanogaster 
mRNA for 
acetyl-CoA 
synthetase 


le-027 


1079042 


acetyl-CoA 
•synthetase - fruit 
fly 


4e-025 


270 


NM 004039 
.1 


Homo sapiens 
annexin H 
(lipocortin II) 
for lipocortin II, 
complete cds 


4e-028 


450448 


(M33322) 
calpactin I 
heavy chain 
[Mus musculus] 


0.1 


271 


X53064 


Homo sapiens 
SPRR2A gene 
encoding small 
proline rich 
protein 


le-028 


134846 


SMALL 

PROLINE- 

RICH 

PROTEIN II 
rich protein 
[Homo sapiens] 


0.005 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






272 


M29863 


Human farnesyl 
pyrophosphate 
synthetase 
mRNA 


le-028 


4503685 


farnesyl 

diphosphate 

synthase 

dimethylallyltra 

nstransferase, 

geranyltranstran 

sferase) bp313 

to bp 1374 is 

almost identical 

to human 

farnesyl 

pyrophosphate 

synthetase 

mRNA. [Homo 

sapiens] 


2e-008 


273 


Z18950 


H. sapiens genes 
for S100E 
calcium binding 
protein, CAPL, 
and S100D 
calcium binding 
protein EF- 
Hand patent US 
5789248 


5e-029 


2493898 


DOPAMINE- 
BETA- 

MONOOXYGE 
NASE 

PRECURSOR 
(DOPAMINE 
BETA- 

HYDROXYLA 
SE) (DBH) 
1.14.17.1) 
precursor - 
mouse 

>gi| 260873 |bb 
s |119249 621 
aa] [Mus sp.] 


1.4 


274 


M19481 


Human 

follistatin gene, 
exon 6. 


5e-030 


<NONE> 


<NONE> 


<NONE 

> 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






275 


AF007155 


Homo sapiens 
clone 23763 
unknown 
mRNA, partial 
cds 


2e-032 


4502641 


chemokine (C- 
C) receptor 7 
TYPE 7 
PRECURSOR 
(C-C CKR-7) 
(CC-CKR-7) 
(CCR-7) (MIP-3 
BETA 

RECEPTOR) 
(EBV- 

INDUCED G 
PROTEIN- 
COUPLED 
RECEPTOR 1) 
(EBI1) (BLR2) 
>gi| 1082381 |p 
ir| |B55735 
lymphocyte- 
specific G- 
protein-coupled 
receptor EBI1 - 
human 
>gi |468316 
(L3158 


1.6 


276 


M99624 j 


Human 
epidermal 
growth factor 
receptor-related 
gene, 5' end. 


8e-034 


294845 


(L13655) 

membrane 

protein 

[Saccharum 

hybrid cultivar 

H65-7052] 


9e-014 


277 


U49082 


Human 
transporter 
protein (gl7) 
mRNA, 
complete cds 


8e-035 


1840045 


(U49082) 
transporter 
protein [Homo 
sapiens] 


le-014 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






278 


D50369 


Homo sapiens 
mRNA for low 
molecular mass 
ubiquinone- 
binding protein, 
complete cds 


9e-036 


3024781 


UBIQUINOL- 

CYTOCHROM 

EC 

REDUCTASE 
COMPLEX 
UBIQUINONE- 
BINDING 
PROTEIN QP- 
C PROTEIN) 
(COMPLEX III 
SUBUNIT VII) 
ubiquinone- 
binding protein 
[Homo sapiens] 


0.0002 


279 


AF086313 


Homo sapiens 
full length insert 
cDNA clone 
ZD52B10 


9e-036 


2832777 


(AL021086) 
/prediction = (me 
thod:; comes 
from the 5 1 
UTR 

[Drosophila 
melanogaster] 


le-039 


280 


NM_004074 
i 


Homo sapiens 
cytochrome c 
oxidase subunit 
VIII (COX8), 
nuclear gene 
encoding 
mitochondrial 
protein, mRNA 
> :: 

gb|J04823|HU 
MCOX8A 
Human 
cytochrome c 
oxidase subunit 
VIII (COX8) 
mRNA, 
complete cds. 


le-038 


2499854 


PROBABLE 
PFPTTDASF 

Y4SO 

>gi |2182630 


2 














281 


AB024436.1 


Homo sapiens 
mRNA for beta- 
1,4- 

galactosyltransfe 
rase IV, 
complete cds 


2e-041 


3132900 


(AF038662) 
beta- 1,4- 
galactosyltransfe 
rase [Homo 
sapiens] beta- 
1,4- 

galactosvltransfe 


4e-016 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 
















galactosyltransfe 
rase IV [Homo 
sapiens] 




282 


AF057734 


Homo sapiens 
17-beta- 
hydroxysteroid 
dehydrogenase 
IV (HSD17B4) 
gene, exon 16 


2e-043 


2842416 


(AL008730) 
dJ487J7.1.1 
(putative protein 
dJ487J7.1 
isoform 1) 
[Homo sapiens] 


3e-062 


283 


Z69650.1 


Human DNA 
sequence from 
cosmid L69F7B, 
Huntington's 
Disease Region, 
chromosome 
4pl6.3 contains 
Huntington 
Disease (HD) 
gene 


2e-044 


1872200 


(U22376) 
alternatively 
spliced product 
using exon 13 A 


le-008 


284 


NM 003938 
.1 


Homo sapiens 
adaptin, delta 
(ADTD) mRNA 

> :: 

rrK 1 TTQ1 Q^n I XJC 

go | uyiyju [ no 
U91930 Homo 
sapiens AP-3 
complex delta 
subunit mRNA, 
complete cds 


2e-044 


3478639 


(AC005545) 
delta-adaptin, 
partial CDS 
[Homo sapiens] 


3e-016 


285 


AF026029 


Homo sapiens 
poly(A) binding 
protein II 
^r'ArSFZ; gene, 
complete cds 


8e-045 


1916930 


(U88570) 
CREB-binding 
protein homolog 
[Drosophila 
melanogaster] 


7.6 


286 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 
gene, partial cds 


le-045 


73404 


E2 protein - 
human 

papillomavirus 
type 5 


0.11 
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Table 2 






Neighbor 
(BlastN vs. 
Genbank) 






(BlastX vs. Non- 
Redundant 
Proteins) 






287 


U90918 


Human clone 
23654 mRNA 
sequence 


le-048 


3877568 


(Z70208) 
similar to 
collagen 


0.042 


288 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 
gene, partial cds 


le-049 


73404 


E2 protein - 
human 

papillomavirus 
type 5 


0.11 


289 


AL049258.1 


Homo sapiens 
mRNA; cDNA 
DKFZp564E173 
(from clone 
DKFZd564E173 

) 


le-050 


<NONE> 


<NONE> 


<NONE 

> 


290 


AF022367 


Homo sapiens 
beta- 1,4- 
galactosyltransfe 
rase mRNA, 
complete cds 


5e-051 


3132900 


(AF038662) 
beta- 1,4- 
galactosyltransfe 
rase [Homo 
sapiens] beta- 
1,4- 

galactosyltransfe 
rase IV [Homo 
sapiens] 


6e-019 














291 


AF057734 


Homo sapiens 
17-beta- 
hydroxysteroid 
dehydrogenase 
IV (HSD17B4) 
gene, exon 16 


7e-053 


2842416 


(AL008730) 
dJ487J7.1.1 
(putative protein 
dJ487J7.1 
isoform 1) 
[Homo sapiens] 


6e-055 


292 


AF097709 


Homo sapiens 
serine protease 
(PRSS11) 
mRNA, partial 
cds 


8e-055 


4506141 


protease, serine, 
11 (IGF 

binding) ! 
> gi |1513059 ) d 
bj|BAA13322| 
(D87258) serin 
protease with 
IGF-binding 
motif [Homo 
sapiens] 
protease, 
PRSS11 [Homo 
sapiens] 


2e-017 
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Table 2 






Nearest 
Neighbor _ 
(BlastN vs." 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






293 


U31629 


Mus musculus 
C2C12 unknown 
mRNA, partial 
cds. 


9e-057 


3025215 


HYPOTHETIC 

AL81.0KD 

PROTEIN 

C35D10.4 IN 

CHROMOSOM 

EIII 

> gi |2146877 [ p 
ir) |S72572 
probable ABC1 
protein homolog 
- Caenorhabditis 
elegans protein 
(Swiss-Prot 
Acc: P27697) 
[Caenorhabditis 
elegans] 


5e-033 


294 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 
gene, partial cds 


8e-057 


73404 


E2 protein - 
human 

papillomavirus 
type 5 


1.7 


295 


AF025439 


Homo sapiens 
Opa-interacting 
protein OIP3 
mRNA, partial 
cds 


4e-059 


<NONE> 


<NONE> 


<NONE 

> 


296 


M99624 


Human 
epidermal 
growth factor 
receptor-related 
gene, 5' end. 


le-060 


123364 


SEGMENTATI 
ON PROTEIN 
EVEN- 
SKIPPED fly 
(Drosophila sp.) 
>gi| 157387 
(M 14767) even- 
skipped gene 
[Drosophila 
melanogaster] 


5.3 


297 


AF045573 


Mus musculus 
FLI-LRR 1 
associated 
protein- 1 
mRNA, 
complete cds 


5e-061 


3025718 


(AF045573) 
FLI-LRR 
associated 
protein- 1 [Mus 
musculus] 


7e-029 


298 


AB006622 


Homo sapiens 
mRNA for 
KIAA0284 
gene, partial cds 


2e-062 


2119133 


ribosomal 
protein S 17 - cat 
(fragment) 
musculus] 


2e-015 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






299 


M30702 


Human 
amphiregulin 
(AR) gene, exon 
5, clones 
lambda- 
ARH(6,12). 


2e-063 


4502199 


amphiregulin 
(schwannoma- 
derived growth 
factor) 

>gi| 113754| sp 
|P15514|AMP 
R HUMAN 
AMPHIREGUL 
IN 

PRECURSOR 
(AR) 

(COLORECTU 
M CELL- 
DERIVED 
GROWTH 
FACTOR) 
(CRDGF) 
>gi| 107391) pi 
r| |A34702 
amphiregulin 
precursor - 
human 
>gi |178890 
(M30703) 
amphiregulin 
[Homo sapien 


0.0002 


300 


L38847 


Mus museulus 
hepatoma 
transmembrane 
kinase ligand 
Sequence 1 from 
patent US 
5624899 


6e-064 


3861228 


(AJ235272) 
unknown 
[Rickettsia 
prowazekii] 


2.9 


301 


L38847 


Mus museulus 
hepatoma 
transmembrane 
kinase ligand 
Sequence 1 from 
patent US 
5624899 


6e-064 


3861228 


(AJ235272) 
unknown 
[Rickettsia 
prowazekii] 


2.9 


302 


Z78141 


M. museulus 
partial cochlear 
mRNA (clone 
29C9) 


8e-066 


1490324 


(Z78141) 
unknown [Mus 
museulus] 


8e-019 
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Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






303 


X12650 


Mus musculus 
gene for beta- 

LJL \J IJ \JLLky KJoUl 


2e-072 


833602 


(X54277) 
cardiac 

trnnomvrwin 

[Coturnix 
coturnix] 


7e-022 


304 


M87635 


Mouse beta- 
tropomyosin 2 
mRNA, 
complete cds. 


2e-084 


1216293 


(L35239) 
cardiac 
tropomyosin 
[Xenopus laevis] 


5e-019 


305 


Ml 3364 


Rabbit calcium- 
dependent 
protease, small 
subunit mRNA, 
complete cds. 


2e-084 


115611 

• 


CALCIUM- 
DEPENDENT 
PROTEASE, 
SMALL 
NEUTRAL 
PROTEINASE) 
(CANP) 
>gi| 108563] pi 
r| |A34466 
calpain (EC 
3.4.22.17) II 
light chain - 

hnvinp 

UU V lilt- 

3.4.22.17) [Bos 
taurus] 


le-058 


306 


M87635 


Mouse beta- 
tropomyosin 2 
mRNA, 
complete cds. 


3e-088 


1216293 


(L35239) 
cardiac 
tropomyosin 
[Xenopus laevis] 


9e-028 


307 


M87635 


Mouse beta- 
troDomvosin 2 
mRNA, 
complete cds. 


5e-092 


1216293 


(L35239) 
cardiac 
tropomyosin 
[Xenopus laevis3 


2e-035 


308 


X85992 


M. musculus 
mRNA for 
semaphorin C 


8e-097 


2137756 


semaphorin C - 
mouse 
(fragment) 
musculus] 


2e-048 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






309 


M24103 


Bovine 
ADP/ATP 
translocase T2 
mRNA, 
complete cds. 


e-103 


113463 


ADP,ATP 

CARRIER 

PROTEIN, 

LIVER 

ISOFORM T2 

(ADP/ATP 

TRANSLOCAS 

E3) 

(ADENINE 
NUCLEOTIDE 
TRANSLOCAT 
OR 3) (ANT 3) 
>gi| 86757 |pir 
| |S03894 
AJJF,A1F 
carrier protein 
T2 - human 


2e-035 


310 


U48852 


Cricetulus 
griseus HT 
protein mRNA, 
complete cds. 


e-107 


1216486 


(U48852) HT 
protein 
[Cricetulus 
griseus] 


3e-057 


311 


X76168 


R.norvegicus 
mRNA for 
connexin 30.3 


e-112 


544118 


GAP 

JUNCTION 
BETA-5 
PROTEIN 
(CONNEXIN 
30.3) (CX30.3) 
>gi| 481577| pi 
r| |S38891 
connexin 30.3 - 
rat 

>gi|431204|e 
mb|CAA53762 
| (X76168) 
connexin 30.3 


le-063 


312 


X76168 


R.norvegicus 
mRNA for 
connexin 30.3 


e-115 


461864 


GAP 

JUNCTION 
BETA-5 
PROTEIN 
junction protein 
Cx30.3 - mouse 
>gi |192647 
(M91443) 
connexin 30.3 
[Mus musculus] 


7e-064 
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Table 2 






Nearest 
Neighbor 
(BlastN vs. 
Genbank) 






Nearest Neighbor 
(BlastX vs. Non- 
Redundant 
Proteins) 






313 


AJ009634.1 


Mus musculus 
fjxl gene 


e-137 


4138203 


(AJ009634) 
Fjxl [Mus 
musculus] 


5e-065 


314 


X76168 


R.norvegicus 
mRNA for 
connexin 30.3 


e-130 


544118 


GAP 

JUNCTION 
BETA-5 
PROTEIN 
(CONNEXIN 
30.3) (CX30.3) 
> gi| 481577 1 pi 
r| |S38891 
connexin 30.3 - 
rat 

>gi (431204 |e 
mb | CAA53762 
| (X76168) 
connexin 30.3 


2e-074 
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PairAB-text 




_21,22 (Normal Prostate vs. Cancerous Prostate) 






_15,16 (Normal Colon vs. Colon Tumor) 


_16,17 (Colon Tumor vs. Colon Metastasis) 






J)8,09 (Lung, High Metastatic Potential vs. Lung, Low Metastatic Potential) 


19,20 (Colon Tumor vs. Colon Tumor Metastasis) 






— 15,16 (Normal Colon vs. Colon Tumor) 


_16,17 (Colon Tumor vs. Colon Metastasis) 






_23,24 (Normal Lung vs. Lung Tumor) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 


16, 17 (Colon Tumor vs. Colon Metastasis) 


_15,16 (Normal Colon vs. Colon Tumor) 
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372700 








729832 
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co 
c4 


3.89 






2.25 


















9.15 






7.12 






6.61 






6.62 






RATIO 
PLUS 
















7.87 






4.43 


























7.11 




CLONES 
inB 


oo 
VO 


T-H 






CN 
CO 






T-H 
























CO 

t-H 








o 




CLONES 
in A 


00 
CN 


oo 

CN 












00 






oo 






o 






o 






CN 






o 






PairAB-text 


15,16 (Normal Colon vs. Colon Tumor) 


_15,17 (Normal Colon vs. Colon Metastasis) 






_16,17 (Colon Tumor vs. Colon Metastasis) 






!_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_15,16 (Normal Colon vs. Colon Tumor) 


_16,17 (Colon Tumor vs. Colon Metastasis) 




CLUST 
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PairAB-text 




_16,17 (Colon Tumor vs. Colon Metastasis) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






J)8,09 (Lung, High Metastatic Potential vs. Lung, Low Metastatic Potential) 






_16,17 (Colon Tumor vs. Colon Metastasis) 


_15,16 (Normal Colon vs. Colon Tumor) 


_21,22 (Normal Prostate vs. Cancerous Prostate) 






!_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






__03,04 (Breast, High Metastatic Potential vs. Breast, Non-Metastatic) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 
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PairAB-text 






_21,22 (Normal Prostate vs. Cancerous Prostate) 


_08,09 (Lung, High Metastatic Potential vs. Lung, Low Metastatic Potential) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 






_21,22 (Normal Prostate vs. Cancerous Prostate) 


CLUST 


379879 








268290 ! 






818043 






450247 






819273 






1 587779 






615617 






818682 






q 


T-H 

o 








5 

T-H 






oo 
o 

T-H 






T— 1 
T— t 






in 

T-H 
T-H 






T-H 






oo 

T-H 

X — 1 






CN 







126 



WO 01/72781 



PCT/US01/09952 



RATIO 
MINUS 






























20.33 






5.59 


6.44 






11.18 






y — 1 


RATIO 
PLUS 






6.88 






6.88 






5.41 






6.88 




























CLONES 
inB 






o 






o 












o 






o 






T 1 


ON 






y—i 
y—i 








CLONES 
in A 


















1 — 1 

T—H 












T-H 






CN 


o 






o 






o 


PairAB-text 






_21,22 (Normal Prostate vs. Cancerous Prostate) 
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Regional 
Lymphnod 
e Grade 


NO 


IN 


NO 


1 Incidence 1 


Lymphnod 
eMet 


o 

1—1 

o 


0/15 


0/12 


Lvmnhnodel 


Met 


negative 


negative 


negative 


Local Invasion 


Invasion 
through 
muscularis 
propria into 
subserosal 
adipose tissue. 


Invades 

through 
muscularis 
propria to 

involve 
pericolonic 

adipose, 
extends to 

serosa. 


Invades full 
thickness of 
muscularis 
propria, but 
mesenteric 
adipose free of 
malignancy 


Histopath 
Grade 


O 


ZO 


ZD 


iPrimarvl 


Tumor 
Grade 


CO 

H 


T3 


T2 


Primary 
Tumor 
Size 






vd 


1 Anatomical 1 


Loc 


Ascending 
colon 


Transverse 
colon 


Cecum 


1 Path ! 


Report 
ID 


co 
oo 


oo 
<N 


oo 


Table 8 
PatientI 
D 


vo 


VD 

vo 


OO 

vo 
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Comment 


Descending 

colon 

polyps, no 

HGDor 

carcinoma 

identified.. 


Melanosis 
coli and 
diverticular 
disease. 


1 

hyperplastic 

polyp 

identified 




DistMet 
Grade 


MO 


MO 


MO 


MX 


Descrip 
Distant 
Met 










Distant 
Met & Loc 


negative 


negative 


negative 


negative 


Regional 
Lymphnod 
e Grade 


N2 


NO 


NO 


NO 


Incidence 
Lymphnod 
eMet 


7/10 


0/12 


o 


o 


Lymphnode 
Met 


positive 


negative 


negative 


negative 


Local Invasion 


Invasion into 

perirectal 
adipose tissue. 


Invasion 
through 
muscularis 
propria into 
percolic 
adipose tissue. 


Extends into 
perirectal fat 
but does not 
reach serosa 


Invasion 
through 
muscularis 
propria to 
involve 
pericolonic fat. 
Arising from 
villous 
adenoma. 


Histopath 
Grade 


ZO 


s 


ZO 


G2 


Primary 
Tumor 
Grade 


CO 

H 


p 


P 


R 


Primary 
Tumor 

Size 




p 

in" 




2 cm 
invasive 


Anatomical 
Loc 


Rectum 


Ascending 
colon 


Rectosigmoid 


Ascending 
colon 


Path 
Report 


o\ 

CN 


CO 


oo 
m 

CO 


o 

CO 


Table 8 
PatientI 
D 


OO 

r-* 


wo 
ON 

CM 


o> 

CO 
CO 


CO 
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Comment 




Two 

mucosal 

polyps 


Tumor 

arising at 

prior 

ileocolic 

surgical 

anastomosis. 


DistMet 
Grade 


MO 


MO 


Ml 


Descrip 
Distant 
Met 






Macrov 
esicular 
and 

microve 

sicular 

steatosis 


Distant 
Met & Loc 


negative 


negative 


! positive 
(Liver) 


Regional 
Lymphnod 
e Grade 


NO 


IN 


Nl 


Incidence 
Lymphnod 
eMet 


o 


to 


vo 


Lymphnode 
Met 


negative 


positive 


positive 


Local Invasion 


Through colon 
wall into . 
subserosal 

adipose tissue. 
No serosal 
spread seen. 


Invasion thru 
muscularis 
propria to 

pericolonic fat 


Invasion 
through 
muscularis 
propria into 
subserosal 
adipose tissue, 
not serosa. 


Histopath 
Grade 


G2 


G2 


O 


Primary 
Tumor 
Grade 


B 


T3 


T3 


Primary 
Tumor 
Size 


*o 

VD 


CO 




Anatomical 
Loc 


Sigmoid 


Ascending 
colon 


Ascending 
colon 


Path 
Report 

TT) 


to 
cn 


CN 

i — i 
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PatientI 
D 


VO 
io 
co 


co 


(N 

o\ 
co 



143 



WO 01/72781 



PCT/US01/09952 



Comment 




rediagnosis 
of 

oophorecto 
my path to 
metastatic 
colon 
cancer. 


DistMet 
Grade 


MO 


Ml 


Descrip 
Distant 
Met 




adenoca 
rcinoma 
in 

multiple 
slides 


Distant 
Met & Loc 


negative 


positive 
(Liver) 


Regional 
Lymphnod 
e Grade 


NO 


NO 


Incidence 
Lymphnod 
eMet 


0/21 


o 


Lymphnode 
Met 


negative 


negative 


Local Invasion 


Cecum, invades 
through 
muscularis 
propria to 
involve 
subserosal 
adipose tissue 
but not serosa. 


Invasive 
through 

muscularis to 
involve 

periserosal fat; 
abutting 
ileocecal 
junction. 


Histopath 
Grade 


ZD 


G2 


Primary 
Tumor 
Grade 


T3 


T3 


Primary 
Tumor 
Size 


o 

\6 


oo 


Anatomical 
Loc 


Cecum 


Ascending 
colon 


Path 
Report 

lAJ 




uo 


Table 8 
PatientI 
D 


CO 

o\ 

CO 


CO 

i — i 
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Comment 


Anatomical 
location of 
primary not 
notated in 
report, 
pvidence of 
chronic 
colitis. 


No mention 
of distant 
met in report 


Dist Met 
Grade 


Ml 


MO 


Descrip 
Distant 
Met 


moderat 
ely 

different 

iated 

adenoca 

rcinoma, 

consista 

nt with 

primary 




Distant 
Met & Loc 


positive 
(Liver) 


negative 


Regional 
Lymphnod 
e Grade 


S5 




Incidence 
Lymphnod 
eMet 


2/17 




Lymphnode 
Met 


positive 


positive 


Local Invasion 


Invasion 
through 
muscularis 
propria 
involving 
pericolic 
adipose, serosal 
surface 
uninvolved 


penetrates 
muscularis 
propria, 
involves 
pericolonic fat. 


Histopath 
Grade 


G2 


G2 


Primary 
Tumor 
Grade 


T3 


T3 


E 

c 

'i 
A 


A % 1111(11 J 

Tumor 
Size 


7.5 cm 
max dim 


CO 


Anatomical 
Loc 




Sigmoid 


Path 
Report 
TD 

M.XJ 


co 
oo 
co 


ON 
CO 


Table 8 
PatientI 
D 


io 
o 

>0 


o- 
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Comment 


Omentum 
with fibrosis 
and fat 
necrosis. 
Small bowel 
with acute 
and chronic 
serositis, 
focal 

abscess and 
adhesions. 




Dist Met 
Grade 


MO 


Ml 


Descrip 
Distant 
Met 




metastat 
ic 

adenoca 
rcinoma 


Distant 
Met & Loc 


negative 


positive 
(Liver) 


Regional 
Lymphnod 
e Grade 


NO 




Incidence 
Lymphnod 
eMet 


oo 
o 


6/12 


Lymphnode 
Met 


negative 


positive 


Local Invasion 


Invasion 
through the 
muscularis 
propria 
involving 
pericolic fat. 
Serosa free of 
tumor. 


Invasion 

through 
muscularis 

propria 
extensively 

through 
submucosal and 
extending to 

serosa. 


Histopath 
Grade 


CO 

O 


ZD 


Primary 
Tumor 
Grade 


T3 


P 


Primary 


rumor 

Size 


i — i 


in 
in 


Anatomical 
Loc 


Ascending 
colon 


Ascending 
colon 


Path 
Report 
ID 


CO 

in 
m 


in 
vo 
m 


Table 8 | 
PatientI 
D 


CO 

m 


vo 
m 
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Comment 


Appendix 
dilated and 
fibrotic, but 
not involved 
by tumor 


tubular 

adenoma 

and 

hyperplstic 

polyps 

present, 

moderately 

differentiate 

d adenoma 

with 

mucinous 
diferentiatio 
n (% not 
stated) 


invasive 
poorly 
differentiate 
d 

adenosquam 
ous 


Dist Met 
Grade 


MO 


MX 




Descrip 
Distant 
Met 








Distant 
Met & Loc 


negative 


negative 


positive 
(Liver) 


Regional 
Lymphnod 
e Grade 


NO 


NO 


N2 


Incidence 
Lymphnod 
eMet 


0/58 


0/22 


5/17 


Lymphnode 
Met 


negative 


negative 


positive 


Local Invasion 


Invasion 
through the 
bowel wall, 
into suberosal 

adipose. 
Serosal surface 
free of tumor. 


extending 
through bowel 
wall into 
serosal fat 


through 
muscularis 
propria into 
pericolic soft 

tissues 


Histopath 
Grade 


ZD 


ZD 


8 


Primary 
Tumor 
Grade 


P 


T3 


P 


Primary 
Tumor 

Size 


11.5 




*o 

CO 


Anatomical 
Loc 


Cecum 


Cecum 


Ascending 
colon 


Path 
Report 


VO 
ON 

^o 


i — i 
r- 


CO 

oo 


Table 8 
Patientl 
D 


io 


)Q 
os 

VO 


oo 
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Comment 


carcinoma 


moderately 
differentiate 
d invasive 
adenocarcin 
oma 


poorly 

differentiate 

d invasive 

colonic 

adenocarcin 

oma 


Dist Met 
Grade 




1 I 

s 


Ml 


Descrip 
Distant 
Met 








Distant 
Met & Loc 




positive 
(Liver) 


positive 
(Liver) 


Regional 
Lymphnod 
e Grade 




NO 


N2 


Incidence 
Lymphnod 
eMet 




0/12 


13/25 


Lymphnode 
Met 




negative 


positive 


Local Invasion 




through 
muscularis 
propria into 
pericolic fat, 

but not at 
serosal surface 


through the 
muscularis 
propria into 
pericolic fat 


Histopath 
Grade 




ZD 


G3 


Primary 
Tumor 
Grade 




m 
H 


T3 


Primary 
Tumor 

Size 




in 


CO 

in 


Anatomical 
Loc 




Descending 
colon 


Ascending 
colon 


Path 
Report 
rn 




m 
o 

oo 


o 
oo 


Table 8 
PatientI 
D 




vo 
oo 


1— 1 
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Comment 


well- to 
moderately- 
differentiate 
d 

adenocarcin 
oma; this 
patient has 
tumors of 
the 

ascending 
colon and 
the sigmoid 
colon 


moderately 

differentiate 

d 

adenocarcin 
oma 


Dist Met 
Grade 


Ml 




Descrip 
Distant 
Met 






Distant 
Met & Loc 


positive 
(Liver) 


positive 
(Liver) 


Regional 
Lymphnod 
e Grade 


NO 


Nl 


Incidence 
Lymphnod 

eMet 


m 


I—* 


Lymphnode 
Met 


positive 


positive 


Local Invasion 


into muscularis 
propria 


through 
muscularis 
propria int 
subserosal 
tissue 


Histopath 
Grade 


Gl 


ZD 


Primary 
Tumor 
Grade 


T2 




Primary 
Tumor 
Size 


o 


oo 


Anatomical 
Loc 


Ascending 
colon 


Cecum 


Path 
Report 
TD 


oo 
o 

OS 


Os 

o 

OS 
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PatientI 
D 


oo 
oo 
oo 


OS 
oo 
oo 
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We Claim: 

1 . An isolated polynucleotide comprising a nucleotide sequence which hybridizes under 
stringent conditions to a sequence selected from the group consisting of SEQ ID NOS: 1-316. 

5 

2. An isolated polynucleotide comprising at least 1 5 contiguous nucleotides of a nucleotide 
sequence having at least 90% sequence identity to a sequence selected from the group consisting of: 
SEQ ID NOS:l-316, a degenerate variant of SEQ ID NOS: 1-3 16, an antisense of SEQ ID NOS:l- 

3 16, and a complement of SEQ ID NOS: 1-316. 

10 

3. A polynucleotide comprising a nucleotide sequence of an insert contained in a clone 
deposited as clone number xx of ATCC Deposit Number xx . 

4. An isolated cDNA obtained by the process of amplification using a polynucleotide 

15 comprising at least 15 contiguous nucleotides of a nucleotide sequence of a sequence selected from 
the group consisting of SEQ ID NOS: 1-316. 

5. The isolated cDNA of claim 4, wherein amplification is by polymerase chain reaction 
(PGR) amplification. 

20 

6. An isolated recombinant host cell containing the polynucleotide according to claims 1 , 2, 

3, or 4. 

7. An isolated vector comprising the polynucleotide according to claims 1, 2, 3, or 4. 

25 

8. A method for producing a polypeptide, the method comprising the steps of: 
culturing a recombinant host cell containing the polynucleotide according to claims 1, 2, 3, 

or 4, said culturing being under conditions suitable for the expression of an encoded polypeptide; 
recovering the polypeptide from the host cell culture. 

30 

9. An isolated polypeptide encoded by the polynucleotide according to claims 1, 2, 3, or 4. 

10. An antibody that specifically binds the polypeptide of claim 9. 

35 1 1. A method of detecting differentially expressed genes correlated with a cancerous state of 

a mammalian cell, the method comprising the step of: 
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detecting at least one differentially expressed gene product in a test sample derived from a 
cell suspected of being cancerous, where the gene product is encoded by a gene comprising an 
identifying sequence of at least one of SEQ ID NOS: 1-316; 

wherein detection of the differentially expressed gene product is correlated with a cancerous 
state of the cell from which the test sample was derived. 

12. A library of polynucleotides, wherein at least one of the polynucleotides comprises the 
sequence information of the polynucleotide according to claims 1, 2, 3, or 4 

13. The library of claim 12, wherein the library is provided on a nucleic acid array. 

14. The library of claim 12, wherein the library is provided in a computer-readable format. 

15. A method of inhibiting tumor growth by modulating expression of a gene product, the 
gene product being encoded by a gene identified by a sequence selected from the group consisting of 
SEQ ID NOS: 1-3 16. 
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1 



0 



1 GCCAQXttaXSC\CQMQQCUX?MG*G^ eQ 

1 0 
*1 QnCTQMMGCGCJJKT\ttC&3^^ 16Q 



1 



0 



1 0 

1 0 

321 toe»ttaw?rcAGCTGAc»«^au^ 400 

0 

481 OtfWMW^^ sso 

• 1 0 

x - ; o 

€41 <3TKXMCA:rCT<MM!ACTCT^ 720 

1 0 

721 C Q TTOE W ltJAOTACTCTCCQCa^ BOO 

1 0 

*oi ****&n*cG*m*^ tao 

1 o 

3«1 C^QCCXA»CTCCTAyAACAAgTCT^^ 1040 

1 0 

1041 TGMGCCTCTTOCACJUlATCAAGCIIfiCCAATC 1120 

1 0 

OTOTOTTTTTCCMAAAAGCC^^ 1200 

1 0 

1201 TTOTCmiCK^rGCTATCTTCQGXACATCCAC^ 1230 
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1 0 

U«l JUK^OTTOTAGMOAOT^ 13 dO 

1 0 

X 3 6 1 AGCACTGCACCaU^S*A<7TG^ 144 0 

1 0 

1441 CCCTCAfcW^TC^TrGa^ 1320 

1521 AQlWCTQCTT^K^jGCCTGCrCTCC^ 1«00 

1 o 

1601 <^CK?*&£&SACXCGK^ 16^0 

1 o 

1 o 

17 $1 WTCTCATOOTCCaaCT^^ ls40 

X VCGQAGCCCA'TtXKMAGCOGCGCOOOTTMCTG^ SO 

lt41 CACKXJVOTCaSKyrCTC?^ 132(> 

« « « * «* <# « • * **• 

51 CttmZCTOOCG^^ «0 
1921 lX»fTTKn^-CCTC2M3GTJ^ 

131 <W<X«2^WreCTCC*^^ 3" 

2123 2124 

371 TCCKTrTCCCOTiUOTACC^^ 412 t^TQ V~7*. »Yc£"M C 

2125 2124 
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, 0 

241 COUU^AOT^^ 320 

300 

401 vaCNPSOBXSCQ^^ 



- -' - 300 
Ml <^50CTOTC^^ 540 

fi2l GOGAAGftaSGJ^^ 720 

300 

aoo 

300 

. 300 

9<i ACTTOATCGCcirGcaoc^ 1040 

300 

1041 j«3?rWC»^ 1X30 

300 

- 1121 otoaccgctacactk^^ 1200 



301 300 

301 aoo , 

I3«i atr» i3« $£u? ^ ftiy '.-5, i 5 
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SEQUENCE LISTING 

<110> Williams, Lewis T. 
Escobedo, Jaime 
Innis, Michael A. 
Garcia, Pablo Dominiguez 
Sudduth-Klinger, Julie 
Reinhard, Christoph 
He, Zhijun 
Randazzo, Filippo 
Kennedy, Giulia C. 
Pot, David 
Kassam, Altaf 
Lams on, George 
Drmanac, Radoje 
Crkvenjakov, Radomir 
Dickson, Mark 
Drmanac, Snezana 
Labat, Ivan 
Leshkowitz, Dena 
Kita, David 
Garcia, Veronica 
Jones, Lee William 
Stache-Crain, Birgit 

<120> Human Genes and Gene Expression Products 
XVI 

<130> PP-01625.001PC 

<150> 60/192,583 
<151> 2000-03-28 

<160> 324 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 214 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ttagtactgc atatgtaaat actacctttt caatgagcta tataaacaat gatagcacat 60 

ccttcctttt actatgtctc acctccttta ggagagaact tccttaagta agtgctaaac 120 

atacatatac ggaacttgaa agctttggtt agccttgcct taggtaatca gactagttta 180 

cactgtttcc agggagtagt tgaattacta taag 214 

<210> 2 

<211> 353 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (353) 
<223> n = A,T,C or G 

<400> 2 

ggcacgagga gagaactaga aaatatgtat attggatata ctatgtgcca ggcacgattc 60 

caagcccctg atacattctc tannnnnnnn nnrinnnnnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnn nnnnnnnnnn nnaaagtgga acactgggat ttgaacaagg ttttggttgg 180 

gcatcttttc ctatgggagc tcagaaatat ctgttgtcta gccctttctc agcctcccaa 240 
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ccttctcggt tccttaccta tgtcacagct gactttgagc taaagtcatc tcggggcagc 
taggtgccta tgtgagctgg cgttcatttc tcactgtttc tccttccaaa tac 

<210> 3 

<211> 399 

<212> DNA 

<213> Homo sapiens 



300 
353 



<400> 3 

ggcacgagcc 

tcatatggga 

tagcacactt 

ataaatagtt 

gatggcatgg 

aaaggagtgc 

cccatctctg 



caccaagagc 
aatgctgctg 
gaaacactgc 
tacctgagcc 
accctacagg 
tgaagccaat 
ttccatacat 



tgcatagagc 
ctatactttt 
gtatcatttt 
ctttaacaac 
aaaaaccaca 
caccatgtaa 
tcttaccagg 



acgtttagct 
aggaatttct 
cctcactcat 
ctcaaatagg 
ccttaccgct 
gcaagataaa 
cactgagag 



agagtaggag 
gagtgcaatt 
gaatatagtc 
ccatatttct 
tctgaccagc 
agcaaagggg 



tttgcagtgc 
tagaaacatc 
atcagaattc 
ctctctggtt 
atcactacaa 
gtcttgcctg 



60 
120 
180 
240 
300 
360 
399 



<210> 4 

<211> 389 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> raisc_feature 
<222> (1) . . . (389) 
<223> n = A, T r C or G 



<400> 4 

ggcacgagga gagggtggtg ggtccctgag ttggtggaaa gggatagagn nnnnnnnnnn 60 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngagtagag gatgcctggt atgaggcaat 3 00 

atttgggata gggaagggaa gcttgggatt ttagctacgt agagacactt gaaaattgga 360 

gggaggaaag gagtgggtgg ctttggagn 389 



<210> 5 

<211> 279 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . - . (279) 
<223> n = A,T r C or G 



<400> 5 

cctctcccct tggaacccaa agaggaacgg ggccgaactt tataaacttt aggcaagggc 60 

aaagggcgtg nnnnnnnnnn nnggggccaa ggggcatttc ccaagcgatt aaaatttggg 12 0 

aacctttggt tacaaaaatt gcggggaaaa tttatttcgg gagcaatttt ccctttaaaa 180 

atttgagaat tcttacccgg agagtgtgac ataatttaag gcgcctctgc ccaaagaggc 240 

catgtgcgtg aggggaatac cgcgtttaat tatcacaaa 279 



<210> 6 

<211> 388 

<212> DNA 

<213> Homo sapiens 



<400> 6 

ggcacgaggc agaggcctcc ctgcactggt cctggcctca ctcttttccc tgacccttgg 60 

ggcccagggc catggaggga cccttaggag ttcaatgaga gagaccatga ggccactggg 12 0 

ctttcccctt cccaggcctc ctgggtgcca cccccttacg ttattcttgg gcctctaata 18 0 

agtgtcccac aggtgcctgg ccaggcccac ctgctgcaga tgtggtctgt gtgtgtgcat 240 
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gtgtgggtgt gtgtgggcac aggtgtgagt gtgtgagcaa cagtacccca ttccagtcgt 30 0 

ttcctgctgt gactaagtca gcaacacagt tcctctgaca tgggccttgg ctgtgcttct 360 
ttgggggtga agagattgcg gaggaagt 3 88 



<210> 7 

<211> 410 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . . . (410) 
<223> n = A,T,C or G 



<400> 7 

ggcacgaggg 

acgactcgga 

ttggagtcgt 

caggcagtct 

tctgaattaa 

gaagctgaag 

ctttcactga 



gaagtcgcgc 
aatttgaata 
tggaagatct 
ctgctggagc 
gagtgctctg 
aattccagct 
catctgnnga 



atgcgcgagt 
ccacagtagc 
aggttacaag 
cagttccccc 
taaactagag 
tgaggtgagt 
tgtgaccaag 



gtacgcgttg 
atggagtgtg 
ggcccattgt 
gagtttacca 
gaaaacgtgc 
gggctactag 
cgccttctca 



ccggcgaaga 
acctcatgga 
tggaagatgg 
aactctgtgc 
aagcaactaa 
gggagatgaa 
ttcagaaaaa 



ggggagcctg 
gactgacatc 
agcgctctct 
ttggctggtg 
cagtccgagt 
ctgcccgtat 



<210> 8 

<211> 229 

<212> DNA 

<213> Homo sapiens 

<400> 8 

ctaacaaaaa acactaaaaa aaaataaaag aaattaattg aaactgacct aactcgtggc 
agggggaact cggctataag acccacaaac cctgctgact cataacaaac tgagttgtaa 
gacattcatc gccgcgatat ccttgagtaa agaatgaact ctggaagccc acccacggac 
aatgcacctt cacaaagatt ctgcactaat ctgagtgaag gtctttggt 

<210> 9 

<211> 380 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (380) 
<223> n = A,T,C or G 



60 
120 
180 
240 
300 
360 
410 



60 
120 
180 
229 



<400> 9 

ggcacgagag 

agattaaatt 

aagaagcaaa 

tcttgcccaa 

cttttaagaa 

nnnnnnnnnn 

nnnnnnnnnn 



tagttgggaa 
tctacaggta 
ggtatagcag 
aatggacggc 
aaattttttt 
nnnnnnnnnn 
nnnnnnnnnn 



atcttttata 
tggagagtcg 
ttccaagtan 
cgcggcattt 
aaaaaatttt 
nnnnnnnnnn 



aatccaccta 
gcttgactac 
nnnnnnnnnn 
aaatgaagaa 
gattnnnnnn 
nnnnnnnnnn 



ttactaccta 
actgtgtgga 
nnnnnagacc 
agatttattt 
nnnnnnnnnn 
nnnnnnnnnn 



ttggtagggg 
gcaagtttta 
aaactctaga 
ttcctttttt 
nnnnnnnnnn 
nnnnnnnnnn 



60 
120 
180 
240 
300 
360 
380 



<210> 10 

<211> 317 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (317) 
<223> n = A, T,C or G 
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<400> 10 

cacacacaca catccactct ctctttttgc tctcttctca cacacacata tctcccttac 60 

tcacacactc tctctcacac accccctctt tcttttcccc cgcactttct ttctctcacg 120 

cgcgcgcgca ctcactctct tttttcttct ctctctcact ctctctctcc gcgcgctctc 18 0 

tcacacgctt tatatctctc tctctgaggg acttctctct cctctcactc ttattttttt 24 0 

gttgtgtttt atagcgtctc tctcttccct nnnnnnnnnn ntctatatat acagagagag 300 

atctctctgc tctctcc 317 



<210> 11 

<211> 391 

<212> DNA 

<213> Homo sapiens 



<400> 11 

ggcacgagag aattagctga aacccaccaa gagctgcata gagcacgttt agctagagta 60 

ggagtttgca gtgctcatat gggaaatgct gctgctatac ttttaggaat ttctgagtgc 12 0 

aatttagaaa catctagcac acttgaaaca ctgcgtatca ttttcctcac tcatgaatat 18 0 

agtcatcaga attcataaat agtttacctg agccctttaa caacctcaaa taggccatat 240 

ttctctctct ggttgatggc atggacccta caggaaaaac cacaccttac cgcttctgac 300 

cagcatcact acaaaaagga gtgctgaagc caatcaccat gtaagcaaga taaaagcaaa 360 

gggggtcttg cctgcccatc tctgttccat a 391 



<210> 12 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (280) 
<223> n = A,T,C or G 



<400> 12 

tgtgcgcgcc cccccggggc gctctctctc tacactcgtg cgctcccccc tctgtctgtc 60 

tctctctcta gagtcacggt ctcctacacg gcgcgcacat gcgaggggca ctnnnnnnnn 120 

nnngctcnnn nnnnnnnnnn nnnnnnnnnn cgnnnnnnnn nnnnnnntcc cttgtatact 18 0 

ctctgtgtgc gcggggacan nnnnnnnnnn nngtgcgcgc gcgagagcgc gcgcgccaca 24 0 

caagagagag cgcgccctnn nnnnnnnnnn naccgcgaac 2 80 



<210> 13 

<211> 311 

<212> DNA 

<213> Homo sapiens 



<400> 13 

cgcttttttg 

aaaggggttt 

agggtgggct 

aaacatgggc 

agaacccctt 

caaatttttg 



ggaacccaaa 
attaattttt 
aggctttgga 
tagggggaaa 
tattaagggt 
t 



ccttttttgg 
ttggcaaaac 
aaggctaaaa 
ccccacccct 
taaaaaaggt 



ctggccggaa 
aggggttaag 
gggggaaatt 
tcagggccct 
taaaaaaggt 



aaaatttcca 
aaaccttccc 
tctggccctt 
ttaaaagggc 
gggaacctca 



<210> 14 

<211> 387 

<212> DNA 

<213> Homo sapiens 



cgggaagggt 
tcccggccta 
gttccaaggg 
ccccaaaaaa 
tgggccaagg 



60 
120 
180 
240 
300 
311 



<220> 

<221> misc_feature 
<222> (1) . . . (387) 
<223> n = A,T,C or G 



<400> 14 
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ggcacgaggt cttttctgcc cacatctcac acaattgagg tgtctgaaca agcttgggga 60 

gggtctataa ggggtaggct cnnnnnnnnn nnncccattt ggaaagggcg ttttgccaac 120 

ccaagggctt ttttaagccg atttttnnnn nnnnnnccgg acttggtaat tggcttttgg 18 0 

ctttttaaag cccaaaaaat aataattaag gggcccaaaa taaggaaggg caaaaaaagc 240 

ctttactccc cctgcctttc aaaaagaaaa ggaaaaaccg gccccccctt aataattggc 3 00 

acccctaaaa aaaggggttt taaaaaaagc caaaaacaaa agggcctgga aaaaaatttt 360 

gacttttttt aacccggaac ctgggaa 387 



<210> 15 

<211> 273 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (273) 
<223> n = A,T,C or G 



<400> 15 

ctgtctctct ctctcccccc ctctccctcc cgcgcgcgca cgctctttca tctctctctc 60 

tacagacagg ggggggtgtt ctctctccct ctcgagaggg accgcttttt ttttctcccc 12 0 

ctctctcaca ctcggggtgt gcgcgctccc tttgggggct tttctatagg gcgcgctcta 18 0 

aagaaagccc gcctttctcc tctgggtgcc tcctcccaca cccgggtttt ctcccccgct 240 

gtttttgaag aaactcctcc tggtctcctt atn 273 



<210> 16 

<211> 283 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (283) 
<223> n = A, T, C or G 



<400> 16 

ctctctctct ggcccccccc ctctctttac acacactttc tctcctctct ctcgctctct 60 

cttttttttt ctctctcccc tcgctctctc tgtgtgtctc tatctcgtgt ctctctctgc 12 0 

gtgtccctca cacacactcg cgcgagagat ctctctctat atctctcctt tgtctgtgtc 180 

tctctctcgc gcgcccacac atctatatat ttttgcgcgc acacgcgaga gtgtgtccct 240 

ctctctctct gcacnnnnnn nnnnnnnnnn cacaccctcc ccc 2 83 



<210> 17 

<211> 392 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) • . . (392) 
<223> n = A,T,C or G 



<400> 17 

ggcacgagat gactccttcc ttaaaatcca gctcaaatct cccccctttt ggtggctttc 60 

tctgacactc catcataaag ctaattgttt aagtatgatc cagtggcaca gtttattcct 120 

acttcataac ttttatctca ctatgttgta agatattagg tatgtttctt ctactaccag 18 0 

taattttcaa agagttaagg aagaaggata gaagacagca gtataggtga atgtgtgcat 240 

gtgttnnnnn nnnnnnnngc catattggcc aaaatttttg gactggctgg taaaacaaag 300 

gcttttcaaa ttttcaaata cctttaaaaa aaacctggaa attgttttgt nnnnnnnnnn 360 

cgcccaaaaa aaaattttgg gcctgggggg ga 392 



<210> 18 
<211> 385 
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<212> DNA 

<213> Homo sapiens 



<400> 18 

ggcacgaggc 

ggcccagggc 

ctttcccctt 

agtgtcccac 

gtgtgggtgt 

ttcctgctgt 

ttgggggtga 



agaggcctcc 
catggaggga 
cccaggcctc 
aggtgcctgg 
gtgtgggcac 
gactaagtca 
agagattgct 



ctgcactggt 
cccttaggag 
ctgggtgcca 
ccaggcccac 
aggcgtgagt 
gcaacacagt 
gaggc 



cctggcctca 
ttcaatgaga 
cccccttacg 
ctgctgcaga 
gtgtgagcaa 
tcctctgaca 



ctcttttccc 
gagaccatga 
ttattcttgg 
tgtggtctgt 
cagtacccca 
tgggccttgg 



tgacccttgg 
ggccactggg 
gcctctaata 
gtgtgtgcat 
ttccagtcgt 
ctgtgcttct 



<210> 19 

<211> 383 

<212> DNA 

<213> Homo sapiens 



<400> 19 

gaaggcttgc 

tgacaggaga 

cccgagttgc 

ggcgtatctt 

aaatatttca 

cacgccccgg 

aggaaaacga 



ggagagaaaa 
atgaatcaag 
aaaactctgg 
aggagaaaat 
agtgggttta 
aatgtgctga 
gagtgtgcga 



ccctggagcc 
tttggggctc 
gtcctatatg 
gttctagctt 
gaggactgga 
agaaaggaaa 
tgt 



atcttcatag 
aaggtgccgg 
tataaactat 
gggaaacaaa 
gttctaaacg 
atgagccagt 



gaagaggaaa 
ccactgggaa 
gccctgagga 
cacaacagga 
ctgcttttac 
atcggcgagg 



ggaaactgta 
aaacagctgc 
aggaatctca 
ccgtgaatcc 
tgtaagtgat 
actatgggca 



<210> 20 

<211> 313 

<212> DNA 

<213> Homo sapiens 



<400> 20 

ctctcccccg 

gcctcttttt 

gttctttttt 

acacactttc 

tgtgtgtgtg 

cgcgcgcgcg 



cgctcttgag 
ctcgcgcgcc 
tgtgcggaga 
tcccctgtgc 
tgttgtgcgc 
cga 



atatgcgcgc 
cccctctctc 
gagagtctgt 
atgtgttttg 
cccccctggg 



cccttttttc 
tcttttgtgc 
ctcaggggtt 
atgctctctc 
gagagcgctc 



ttctacacgg 
gcacgcgcgc 
tttttgtttt 
gagatatgtc 
ttctctctct 



gggggggcgc 
gcgcgggggg 
ctttcacgac 
tctctctctc 
cctcatatag 



<210> 21 

<211> 396 

<212> DNA 

<213> Homo sapiens 



<400> 21 

ggcacgaggg 

cacctggcta 

cattgcaaac 

catct-gctct 

tatagaaacc 

agcaaatcac 

gagagactgc 



gacccccttc 
gcacagagca 
caaatgaagg 
gcagtggaat 
gagtcaagtg 
cttaacgatc 
cgtcttcttg 



acctctgtct 
ggctcacttg 
ttgttgaatg 
gcctgtgtgt 
atcatgtgca 
tggaatgaaa 
tttggccata 



agagagctgg 
tcttggtccc 
acccctgtcc 
ttgagttcac 
tccagacaca 
ctgtgaccag 
ggtgcg 



gtagatcaga 
actacccaga 
ccagccactt 
tctgcatctg 
ctgtgtcacc 
tgccgccctg 



aacttggtga 
ttcctgcaga 
gttttgttat 
tatatttgag 
tgagccacag 
ggtggttctg 



<210> 22 

<211> 310 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (310) 
<223> n = A f T, C or G 



60 
120 
180 
240 
300 
360 
385 



60 
120 
180 
240 
300 
360 
383 



60 
120 
180 
240 
300 
313 



60 
120 
180 
240 
300 
360 
396 
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<400> 22 

tgaatatcag 

gcgcgggctt 

agagcccgct 

ccacagactt 

tcctagaggg 

caatagaagg 



ggccctgaac 
tttctctctc 
ctctctgcct 
ttttttaaaa 
ggggcgnnnn 



atctctcacg 
tcagaggggt 
gtaaaacaca 
caaaaagtat 
nnnnnnnnnn 



cccgtcttct 
gaaacacaca 
cagaagtgcg 
attggggtgt 
ntttttaaat 



aaaagagaag 
atatctcggg 
ctcacgccct 
gttttaatct 
aggggggccc 



aaaaaaacgc 
gggccggggg 
gcgcgggagc 
ccctctccgc 
gagtctcacc 



60 
120 
180 
240 
300 
310 



<210> 23 
<211> 375 
<212> DNA 
<213> Homo 



sapiens 



<400> 23 

ggcacgagcc 

ggagtgtgac 

cccattgttg 

gtttaccaaa 

aaacgtgcaa 

gctactaggg 

ccttctcatt 



ggcgaagagg 
ctcatggaga 
gaagatggag 
ctctgtgctt 
gcaactaaca 
gagatgaact 
cagaa 



ggagcctgac 
ctgacatctt 
cgctctctca 
ggctggtgtc 
gtccgagtga 
gcccgtatct 



gactcggaaa 
ggagtcgttg 
ggcagtctct 
tgaattaaga 
agctgaagaa 
ttcactgaca 



tttgaatacc 
gaagatctag 
gctggagcca 
gtgctctgta 
ttccagcttg 
tctggggatg 



acagtagcat 
gttacaaggg 
gttcccccga 
aactagagga 
aggtgagtgg 
tgaccaagcg 



60 
120 
180 
240 
300 
360 
375 



<210> 24 

<211> 477 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (477) 
<223> n = A,T,C or G 



<400> 24 

gctccttctt cttnttgttg atcccatcga tccgaattcg gcacgagagc acctctgtgc 60 

ctctctgaga gcactcacag ccaaaagtac acagctgccc ccaggctgag agtgcttgat 12 0 

acacccttga atcccctctt atatgatgcc ccagcccagg agagataaaa gcatcagcac 180 

catgagattc acctgcctct ggtcgtnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnact cttagacagc aaaaatgctt 3 60 

tctcccagtc ttgttccctt gttctcagtt cccaccctgc ctggataact actgttcttg 420 

gtttnnnnnn nnnnnnnnnn nnnnnnnnag tctcgtacca gattcataaa tcagccg 477 



<210> 25 

<211> 265 

<212> DNA 

<213> Homo sapiens 



<400> 25 

cgcgcggggg ggacccctct ctctctctct gttgcgcgcg ctctctcacc ccgtgtgtcg 60 

cccccgatat tgtcagagag accccctatt tttttctccc gccccacaca catctatgtg 120 

taaaatgtgc gtgtctgtcg cgcacaccca cacactctcc ccggggggtt tataaaatac 180 

tcgcgcgcta tattttcgcc cccctttttg tgtgtgggcg ccacaaaaac accacacgct 240 

ctcccccctg tctctcgcgg gtgtt 265 



<210> 26 

<211> 388 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (388) 
<223> n = A,T,C or G 
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<400> 26 

ggcacgaggg 

ctgaccgaag 

atagaaggac 

tgcatcaacg 

aaattgaaat 

gaagaatttt 

gaaacaacaa 



aggctctttg 
aggcagccgt 
ccatagcctt 
gtgcttgtgc 
cgccttacaa 
catcaaggca 
aacttgtcaa 



ttatagatgc 
gactgtaaca 
gaagttctca 
attccaccat 
tgtctgttct 
tctgtagaga 
gctgactn 



ttttgccccc 
cctccaatca 
cacctttgcc 
gagctagaga 
ggagaaagac 
tcagtgagcc 



ttaatacagc 
cagcccagca 
tggaagatca 
aagccatctg 
gaccactgtg 
caaaattaaa 



aatgagagca 
agctgacaac 
taacagttac 
caggtgtcta 
aggcctttgt 
gttttcagat 



<210> 27 

<211> 431 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> (1) . . . (431) 
<223> n = A,T,C or G 



<400> 27 

ggcacgagag 

ccggacaaaa 

tttgaagaaa 

aagcagctta 

ctttctcaag 

ggagtggtgc 

tgtccagcat 

acccaaaaag 

<210> 28 
<211> 389 
<212> DNA 
<213> Homo 

<400> 28 

ggcacgagcc 

gacatgtccc 

ccctgtgccc 

ctttggccag 

gaatggcacg 

acccctcttc 

caataatacg 

<210> 29 
<211> 431 
<212> DNA 
<213> Homo 

<400> 29 

ggacgaggct 

gctattgtca 

gtattgatgc 

acagtgcact 

gcattctgct 

gggttaggga 

gtaaccgtgc 

ttagtggtat 

<210> 30 
<211> 393 
<212> DNA 
<213> Homo 



aggggctact 
agttgtaaat 
ctggggaggc 
ggggctggtt 
aggagaggtg 
tcagtctgac 
gtagtcatat 



ttagatgcaa 
aaatcacggt 
cgtgggagag 
tccagttttc 
agtgggaagg 
ttgcacatag 
caatgnnnnn 



aggggacaat 
ggcagtatgg 
gctggctagt 
tggcactgca 
agcagggctg 
gggagattat 
nnnnnnnnnn 



tagaaggcta 
tgaatagtgg 
gagaaatggg 
gactgggtag 
caggggaggt 
tttagatttc 
nnnnnnnnnn 



ctgaggtaat 
aaggggtgta 
ccgaaggtga 
tgggaggtgg 
catggtcttg 
cgcaagaaaa 
nntgagattt 



sapiens 



acccccaaga 
tatgacatag 
gctgagcact 
tcttgcgggc 
cccgtaatgc 
tccaaccact 
ggggccacct 



sapiens 



ccagcgcact 
ccttacttgg 
tttctgactg 
gtactcatga 
tggtattaat 
aatctgatcg 
ttcaggcagc 



gtgtggccat 
cagtggtgag 
tccatgaagg 
cctcggtgac 
tgcagaccac 
caggaaacct 
acccccacc 



ctggggccgt 
cctggaggag 
cgaggctgtg 
ctcaggcatc 
gtgtgctgtg 
ccttggcata 



gtggtatttg 
gacctggatg 
agtgtggtgg 
ctttccgctg 
cacagcggct 
atcaccagca 



ccactcagga 
atgtccccat 
gctttggcgt 
tggtgcaggt 
ccagtggggg 
acacccggga 



tttccaacac 
gagtgatcca 
ggacacgatg 
agccggctac 
gatgctgctc 
atataaaagt 
tgtgggaggc 



atcactgcat 
gttggagctc 
ctttacaacc 
ccactatata 
cgacctcttc 
atttatgctg 
cttttatatt 



tatttgaatg 
tttatattcg 
caaggccaga 
ccatcgtatt 
tggtgaagaa 
cactttactt 
acgccttccc 



caccatggca 
gacatgtcga 
ttacggtacc 
tatctattac 
gaatgcatgt 
cttcccaaat 
atacattata 



60 
120 
180 
240 
300 
360 
388 



60 
120 
180 
240 
300 
360 
420 
431 



60 
120 
180 
240 
300 
360 
389 



60 
120 
180 
240 
300 
360 
420 
431 



sapiens 
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<400> 30 

ggcacgagac tacacccgct tcgatgactg gtacctgtgg gttcagatgt acaaggggac 60 

tgtgtccatg ccagtcttcc agtccttgga ggcctactgg cctggtcttc agagcctcat 12 0 

tggagacatt gacaatgcca tgaggacctt cctcaactac tacactgtat ggaagcagtt 180 

tggggggctc ccggaattct acaacattcc tcagggatac acagtggaga agcgagaggg 240 

ctacccactt cggccagaac ttattgaaag cgcaatgtac ctctaccgtg ccacggggga 3 00 

tcccaccctc ctagaactcg gaagagatgc tgtggaatcc attgaaaaaa tcagcaaggt 360 

ggagtgcgga tttgcaacaa tcaaagatct gcg 393 



<210> 31 

<211> 459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (459) 
<223> n - A,T,C or G 



<400> 31 

gcaatcgcat tgtctttttg aggatnnnat naatgtcaat tcggcacgag ctttgtggat 60 

gtttccagct gccatcgtca cccttctgtc tgctccctgg accagcttca ggacttgaag 120 

gccctcgtgg ctgagatcat cacacatttg caggggctgc agagggactt atctctagca 180 

gtctcctaca gcaggctcca ttcctcagac tggaatctgt gtactgtatt tgggatcctc 240 

ctgggctatc ctgttcccta tacctttcac ctgaaccagg gagatgacaa ctgcttagct 300 

ctgactccac tacgagtatt cactgcccgg atctcatggt tgctaggtca acccccaatc 360 

ctgctctatt cttttagtgt cccagagagt ttgttcccac gcctgaggga cattctgaac 420 

acctgggaga aagacctcag aacccgattt atgactcac 459 



<210> 32 

<211> 445 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1).. . . (445) 
<223> n = A, T,C or G 



<400> 32 

ggcacgagat ggagagcacc tctgtgcctc tctgagagca ctcacagcca aaagtacaca 60 

gctgccccca ggctgagagt gcttgataca cccttgaatc ccctcttata tgatgcccca 120 

gcccaggaga gataaaagca tcagcaccat gagattcacc tgcctctggt cgtnnnnnnn 18 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnactctt agacagcaaa aatgctnttc tccagtcttt gttccttggt ctcaaggtcc 360 

acccttgctg gataactact ggtcttgggt tcctggggta aagatggaac ttgagtaagc 42 0 

tcgacccaaa tccaaaatca atccg 445 



<210> 33 

<211> 429 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (429) 
<223> n = A,T,C or G 



<400> 33 

ggcacgagcg cctgccctgc atcagggaga catgtcagct gaggagtaat tgaccagatt 60 

tctgctttag aaatatggca gtggaggcag gagatggcat ctgaggccca ggctggggag 12 0 

aagggtgctg ggatgagaac ctggagttca gaccagggaa gggatgagag cctaagaaga 18 0 
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ggagctctca ccctgagaca ggctggtgca ggagtctgct cgatccaggc ctgggtccct 24 0 

ggttccctct gagcttggga ggactatgtg agacagaaca ggaccagggg cctgcattcc 3 00 

cccttgtatt attcatcnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna 420 

tgatggccc 429 



<210> 34 

<211> 439 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (439) 
<223> n = A, T,C or G 



<400> 34 

gttctgtggg aatagagggt ccctggtgac agggcagggc tagatctgga gcctgcactt 60 

ggcctgtgac atactgtctt gtttctgaga atcctcccct acttctctag ttaatctcca 12 0 

gagacttctg tgactactta atcacaaagg aaattttcag gaatattatc aaatactatt 180 

ttagaaaaaa aaagagaagg gatttgaatg ttttcagttc agtttagnta tcnnnnnnnn 24 0 

nnnnnnnccc caaacttcaa aatggaggcc cccccctcct ttaacccccc taaaaaaaat 300 

tctgatgttt gaggtttggt tgccaattaa ccaaaccccc aaaaaaaaag ggggttaaac 360 

cccattggaa agttttccta attttggggg gtgccctttg aggtggaccc ggttccctgc 42 0 

cctgggaaag gccccaaag 439 



<210> 35 

<211> 440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (440) 
<223> n = A, T,C or G 



<400> 35 

ggcacgaggt gaagtcctgg ttccagactc ccctttttgc cgggacatga tggatctgtc 60 

agctggtgcc tatagtccta gagagctaga gatggaggga aattcagatc atctaaaccc 120 

ttcagccctt cactggacag aagaggaaac tgaggctcca tctgcatgac gttcccagag 180 

tcacggcaca aattcatgga agaagcagca ggaaactcag ttctccagtc tgggtccaat 240 

gtgtgtttta gaaatatctc cacagggtta atgactcaat ttttcatgca tgattgctag 300 

taatgacaat catgttatgt ttgtttctgt agctttggaa atcactcctt ccacttgagt 360 

ttcaggtccc aactgtccac acctgcagga gtgaggtttt gctgagactg ataaggcact 420 

cacattntgt gggagttgaa 440 



<210> 36 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (423) 
<223> n = A,T,C or G 



<400> 36 

acgagcgcnn nncctgcatc agggagacat gtcagctgag gagtaattga ccagatttct 60 

gctttagaaa tatggcagtg gaggcaggag atggcatctg aggcccaggc tggggagaag 120 

ggtgctggga tgagaacctg gagttcagac cagggaaggg atgagagcct aagaagagga 18 0 

gctctcaccc tgagacaggc tggtgcagga gtctgctcga tccaggcctg ggtccctggt 240 

tccctctgag cttgggagga ctatgtgaga cagaacagga ccaggggcct gcattccccc 300 

ttgtattatt catcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnatga 420 
tgg ^ 423 

<210> 37 

<211> 424 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (424) 
<223> n = A,T,C or G 



<400> 37 

ggcttgtaga 

ctctcttata 

tgttaaccac 

gactactttc 

tggtaactcc 

cttttcatgg 

gtccactgtt 

taga 



nctcggaggt 
tttgttgctg 
catgggaaaa 
tactctcagc 
tataagctgg 
actgaaggca 
cccttaatct 



tngcaagaat 
gaataaatga 
ttaaataaat 
ttctgattcc 
tcctcctccc 
ccaccaagat 
ctgtcctgct 



cgcattcggc 
atgaactaag 
gcggggaagg 
ctctgagcta 
atgctgaccc 
agatccagga 
ccaagtataa 



acgagctggg 
gcagtcttgt 
aaaacgttct 
agaaccagac 
catctttact 
gtgacaactc 
ataaatcggg 



acacagtgng 
agggatttac 
aaaattagaa 
agccttaggc 
gtacaattca 
cagtgtaggt 
gccatttcct 



60 
120 
180 
240 
300 
360 
420 
424 



<210> 38 

<211> 434 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (434) 
<223> n = A,T,C or G 

<400> 38 

ggcacgaggt acacagctgc 
ttatatgatg ccccagccca 
ctggtcgtnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnna 
ttgttctcag ttcccaccct 
nnnnnnnnnn agtctcgtac 
tgctggctac ttaa 

<210> 39 

<211> 428 

<212> DNA 

<213> Homo sapiens 



ccccaggctg 
ggagagataa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcttagaca 
gcctggataa 
cagattcaaa 



agagtgcttg 
aagcatcagc 
nnnnnnnnnn 
nnnnnnnnnn 
gcaaaaatgc 
ctactgttct 
aatcagtcaa 



atacaccctt 
accatgagat 
nnnnnnnnnn 
nnnnnnnnnn 
tttctcccag 
tggtttnnnn 
ctacttcaaa 



gaatcccctc 
tcacctgcct 
nnnnnnnnnn 
nnnnnnnnnn 
tcttgttccc 
nnnnnnnnnn 
aacaatgaca 



60 
120 
180 
240 
300 
360 
420 
434 



<400> 39 

ggcacgagct 

cagcttcagg 

agggacttat 

actgtatttg 

gatgacaact 

ctaggtcaac 

ctgagggaca 

gactttgc 



ttgtggatgt 
acttgaaggc 
ctctagcagt 
ggatcctcct 
gcttagctct 
ccccaatcct 
ttctaaacac 



ttccagctgc 
cctcgtggct 
ctcctacagc 
gggctatcct 
gactccacta 
gctctattct 
ctgggagaag 



cagcgtcacc 
gagatcatca 
aggctccatt 
gttccctata 
cgagtattca 
tttagtgtcc 
gacctcagaa 



cttctgtctg 
cacatttgca 
cctcagactg 
cctttcacct 
ctgcccggat 
cagagagttt 
cccgatttag 



ctccctggac 
ggggctgcag 
gaatctgtgt 
gaaccaggga 
ctcatggttg 
gttcccaggc 
gactcagaat 



60 
120 
180 
240 
300 
360 
420 
428 



<210> 40 

<211> 429 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> (1) . . . (429) 
<223> n = A,T,C or G 



<400> 40 

ggcacgagtg 

ctgcccccag 

cccaggagag 

aatggaggcc 

ctagaggaat 

caaatgtttg 

agcatcagca 

agatggccn 



gagagcacct 
gctgagagtg 
ataaaagcat 
tgcgatttgg 
ccagtaggat 
caagtacagg 
aagccaaaag 



ctgtgcctct 
cttgatacac 
cagcaccatg 
agttaaactc 
gtgggcatgg 
gcagaatttc 
tttcattttc 



ctgagagcac 
ccttgaatcc 
agattcacct 
tcagtgatct 
ttttcccgga 
atccagcctc 
ttgactgtgg 



tcacagccaa 
cctcttatat 
gcctctggtc 
ctgtgttgac 
aggctgactg 
agaaccttga 
gagtgctagt 



aagtacacag 
gatgccccag 
gttagggaac 
aacaccaaag 
agcagttctg 
gccaagactc 
cccaaccttt 



60 
120 
180 
240 
300 
360 
420 
429 



<210> 41 

<211> 430 

<212> DNA 

<213> Homo sapiens 



<400> 41 

actctgcaaa 

ccgagacctg 

catttaccaa 

cattcagcat 

aattcagatc 

catggggcca 

tgttacttga 

ggaccatggg 



cagcta.cttg 
aaggggctga 
gcctacaata 
tacagcaacc 
atccttaagg 
gcagggacgt 
atgccttccc 



tgctgattgc 
cgaacgcgat 
aatgcacgcg 
acgtcgcctt 
agctggaagg 
ggccccacgc 
tgagggaaga 



aggagaccca 
ttctgataag 
aggaatctta 
cctgctggac 
cctctcgagc 
cacacacaac 
ggcccttgag 



taaattcgaa 
tatggggtcc 
tgcaacatgg 
atggcggagc 
atacaaaccc 
ctctccacat 
tcacagaccc 



cgaggaacaa 
ctgaagagaa 
acaacaacat 
tggacggcaa 
tcacgacctg 
gcctcaacgc 
acagacgtca 



60 
120 
180 
240 
300 
360 
420 
430 



<210> 42 

<211> 437 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . . . (437) 
<223> n = A,T,C or G 



<400> 42 

ggcacgaggc gccctctgcc cccctcagag ggtctctcct ctcgaccccc aaattccccc 60 

agcatctcaa tcccttgcat ggggagcaag gcctcgagcc cccatggttt gggctccccg 12 0 

ctggtggctt ctccaagact ggagaagcgg ctgggaggcc tggccccaca gcggggcagc 18 0 

aggatctctg tgctgtcagc cagcccagtg tctgatgtca gctatatgtt tggaagcagc 240 

cagtccctcc tgcactccag caactccagc catcagtcat cttccagatc cttggaaagt 300 

ccagccaact cttcctccag cctccacagc cttggctcag tgtccctgtg tacaagaccc 360 

agtgacttcc aggctcccag aaaccccacc ctaaccatgg gccaacccag aacaccccac 420 

tctccaccac tgggcan 437 



<210> 43 

<211> 432 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> {!)... (432) 
<223> n = A,T,C or G 

<400> 43 

ggnncagtga ccaccaggac ctggtgtctg tgcacatcta catcacccag ctggctgaga 60 
agttcgacct caggaccact atgctgtaca tctgtgagcg gcacttccag aaggttctga 120 
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accggagtct attcacaggc ctgcgctcca tcacccactt tggccgtccc ccctttgagc 180 

ccttcttcaa ctccctgcag gaggtccacc cccaggtccg gaagatcggg gtgtttagct 240 

gtggcccccc tggcatgacc aagaatgtgg aaaaggcctg tcagctcatc aacaggcagg 300 

accggactca cttctcccac cattatgaga acttctaggc cccttcccgg gggttctgcc 360 

cactgtccag ttgagcagag gtttgagccc acacctcacc tctgttcttc ctatttctgg 420 

ctgcctcagc cc 432 



<210> 44 

<211> 436 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (436) 
<223> n = A,T,C or G 



<400> 44 

ggcacgagcc 

gcaccgcttc 

gacggtgcag 

tgtctttttt 

gactgttcga 

caacttagcc 

tggtgatctc 

gctcatagac 



gaggcgcgcg 
cccaagggct 
gaggcccggg 
gtggactatg 
gattatattc 
aagacaattg 
accacctcca 
tntggg 



tgttccgtgg 
accggcaccc 
cgctcctccg 
cttccaactg 
agtccactat 
ggcaggtttt 
acatgctcct 



ccgcttccag 
ggcgctggag 
ctgtcgccgc 
cttatatatg 
ggagactgaa 
ggctcgaatg 
gaaacccccc 



ggccgcgcgg 
gcgcggcttg 
gctggaatat 
gaagaaattg 
aaaactcccc 
cacgatgaag 
cttgaacagc 



cggtgatcaa 
gcagacggcg 
ctgccccagt 
aaggctcagt 
agggtctctc 
acctcattca 
tgaacattgt 



<210> 45 

<211> 300 

<212> DNA 

<213> Homo sapiens 



<400> 45 

tctctctctc 

gagagagaga 

atccccacac 

tatatctgtg 

gatgtgtgag 



tctctctcac 
gagccctctc 
tttatagggc 
gggggagaga 
cgcccccccc 



agacactttt 
gctctatata 
gggctccccc 
gagatctctc 
ctgtttttgt 



accccatata 
tatccccgcg 
ctctatcctg 
acccccccgc 
aaaaaagaga 



tacacataaa 
ggggggagat 
tgtgtagaga 
acacgcgagc 
ggggtatata 



atgtgtgcgc 
aaaaatatat 
gaaatatata 
tctttcttaa 
tattgggggg 



<210> 46 

<211> 191 

<212> DNA 

<213> Homo sapiens 

<400> 46 

caaaacaaaa ccatgttccc actggtgatg cctgtctgac acgttttggt atttagtagg 
aaatgaaggg tcttcaagct tcgagagaac cttcaaaatt gtcacaattg ctgaaaacag 
aatgaatcgg gaacattatc tcaatatttt gcataataga caacaccaca gtgttttggt 
tccctgacct g 

<210> 47 

<211> 302 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (302) 
<223> n = A,T,C or G 

<400> 47 

gcccgggcgt gtgtgtatgt gtgtacacgc ccccgtgggc tctctgtcgc atcttgnnnn 
nnnnnnnnnn nnnnnnnnnn nntgtannnn nnnnnncaca tagcgcgcgc gctcgcgcgc 



60 
120 
180 
240 
300 
360 
420 
436 



60 
120 
180 
240 
300 



60 
120 
180 
191 



60 
120 
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acggagctat agagacacca ctctctctct gagatacacg cgcgcgcaca cactctgcgc 18 0 

gcgcgcgctc ttctttgtct cgcgcgcgcg cccgctatgt ggagggtata tgtgggggaa 24 0 

aatagcgagg tgtgcgcgca cccgcgcacg cgcgctctat atctctatat cttcagcgcg 300 

eg " ~ " ~ 302 

<210> 48 

<211> 411 

<212> DNA 

<213> Homo sapiens 



<400> 48 

ggcacgaggc 

cttgacaaag 

aaagggatct 

aagtcaacct 

gctgaaaaca 

atagaaggag 

aaactttgag 



ttgeggggea 
agaagacagc 
gattggtgac 
tcatctttta 
attaagagtt 
catagtgttc 
taacatagca 



ttaggactag 
aacaactgta 
ctctgggctt 
caatagaaat 
atgggtactt 
tatacagtca 
gtctatgttt 



agggttggtg 
acaattatct 
tatcaaatta 
catatgtttt 
aaaaaaatca 
cacacataca 
gggtcagttt 



aaaattcaga 
tatgaatatt 
acatcacaac 
gctaacccat 
ttatgtttat 
cttccttatt 
tccctttttt 



cagaatgtaa 
tgegaaaetc 
ttctagaaga 
tcctatttag 
aaaattagtg 
tcttttattt 

g 



60 
120 
180 
240 
300 
360 
411 



<210> 49 

<211> 408 

<212> DNA 

<213> Homo sapiens 



<400> 49 

ggcacgaggc 

tctcatgtgc 

gccccctgtc 

agatacaagt 

aagcttgeta 

aacttacgea 

gcacacccct 



acacaaagcc 
agctgaaata 
tcacacagtt 
tcagggactc 
tgctgttctt 
tcacaccact 
tagttaagag 



aagggcatac 
gtgatctget 
tgtaatgaag 
ageaatgeae 
tccattcctt 
gagtcctcta 
agaacctcca 



cctatagagt 
tctgtcactg 
acagctcctt 
tttaggactg 
ttccctctga 
actaatcata 
tatacattaa 



aaagctgcag 
tcacatagac 
ctcatctttc 
agctaggagg 
aacacacaaa 
tgtgctcaga 
tttttttc 



ccaccctgtg 
agccctgcat 
cataagectg 
caaatatctg 
ataccaaagg 
cacagctcaa 



60 
120 
180 
240 
300 
360 
408 



<210> 50 

<211> 407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (407) 
<223> n = A r T r C or G 



<400> 50 

agagaaacat 

cttggcctgt 

ccagagactt 

attttagaaa 

nnnnnnnnnn 

ttttttgggg 

acccccttga 



ccactcgaat 
gacatactgt 
ctgtgactac 
aaaaaagaga 
ncccaaactc 
ttcagggtgg 
aaaagttttc 



teggcacgag 
ctggtgtctg 
ttaatcacaa 
agggatttga 
aagtatggag 
gttggccaac 
ttactttggg 



gacagggcag 
agaatcctcc 
aggaaatttt 
atgttttcag 
gcccccccct 
tacccaaacc 
gggctgccct 



ggctagatct 
cctacttctc 
caggaatatt 
ttcagtttag 
ctttaaaccc 
cccaaagaaa 
tgagccg 



tttttctgea 
tagttaatct 
atcaaatact 
ttatcnnnnn 
accaaaaaaa 
atgggggtta 



60 
120 
180 
240 
300 
360 
407 



<210> 51 
<211> 312 
<212> DNA 
<213> Homo 



sapiens 



<400> 51 

ccccgggggc 

ctegcatttt 

agaatgtgcg 

tctcttttta 

agagegcaca 



gctctctttt 
ttcgacaccc 
tggtgtgtct 
gggggcgggg 
gagtgttttt 



tttttccccc 
cccttgtgtg 
gagagacact 
ggggagtttt 
tatatctgtg 



caagtgagag 
gggcggggcg 
cttcgcgctt 
atgtgtgcca 
agagagacct 



agccccgcgc 
cgcgtctgtg 
gtgtgtgaga 
catgttttct 
ctctgtatat 



gcgtctctct 
tgtgatacac 
cacgagactt 
gtgtataaaa 
atacacgctc 



60 
120 
180 
240 
300 
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agaggggaga gg 

<210> 52 

<211> 420 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (420) 
<223> n = A,T,C or G 



312 



<400> 52 

acgagggnnn nnaagcaccg cgggtacccc atgagggcct acaagctggc caccctggcc 60 

atgacccatc tcaacctgag ctacaatcag gacacacacc ctgccattaa tgatgttttg 120 

tgggcctgtg cgcttagcca ctcccttggt aaaaatgagc ttgcagctat aatacctctg 180 

gtggtcaaga gtgtcaagtg tgcaacggta ctgtcagaca ttttgcgcag atgcactctg 240 

accactcctg gcatggtggg acttcatggg aggaggaact ctggtaagct catgtcactg 300 

gacaaagccc ccttgaggca actcttggat gccacgatcg gggcctacat caacacaacg 3 60 

cactcacggc tcacacacat cagtcctcgg cactatagtg agtttataga gttcctcagc 420 



<210> 53 

<211> 394 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 
<222> (1) . . . (394) 
<223> n = A, T, C or G 



<400> 53 

ggcacgaggt 

ccccccnnnn 

ttgccccaaa 

aaaagggtcg 

ggaaaactca 

ttttaaacca 

taagggctaa 



gtggatgaca 
nnnnnnnaaa 
aaaaattggc 
gtnnnnnnnn 
atgggggttg 
gtaaaaatgg 
aaattttcag 



gagcgagacc 
aacccggttg 
cgggccctaa 
nntttttaaa 
gaacaaaatc 
ccaannnnnn 
atttgaacct 



ctgcctcatn 
ggccccggct 
aaaaaccccg 
cttccaaccg 
aatatttggt 
nnnnnnnnnn 
tttt 



nnnnnnnnnn 
gttctttagg 
gttttttggg 
gcctcagggg 
cctaccggaa 
nnnnaacagg 



nnnnnnnnnc 
gccctaaaaa 
gagaattcaa 
gaaaaaacct 
agcgttaaga 
gcccccgggg 



60 
120 
180 
240 
300 
360 
394 



<210> 54 

<211> 390 

<212> DNA 

<213> Homo sapiens 



<400> 54 

ggcacgagat tttcttggca ataagcggac tctgggactc cggctcccta ccccaaactg 60 

aagcgcttcc gtgaacaccc ccgtcctccg tagggggagg ggagcaggcg ggatcctggg 12 0 

tccctcataa gcactttggt tttaccgcct gcaacctcac tgtgcccgcc ccgcaccatg 180 

ccctagcccc aggtctagcc gggcccattg cagggggcag cacttggggg catctccggc 240 

acttgggtgg gaccaaggag atgccaccat agacctttcc ctcgccttct tcctccctag 300 

tccgggttcc attcttttca ccagcaccca tcgcccaagg ggtaccgagg gggggcaggg 3 60 

ggtggtcaat tcaaacccaa cccccgctcg 390 



<210> 55 

<211> 280 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . - - (280) 
<223> n = A,T,C or G 
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<400> 55 

tctctctctc tctctgcgcc cacacctctc tcannnnnnn nnngcacgtg tatatctnnn 60 

nnnnnnnnnn ttttttttag agagacatct cgcgcgtgtc tctctttttc ccgcccgccg 12 0 

ctcttttctc gcgcgcgcgc gcaccccccc tgtgtggggc gcgcgctctc tttttttttg 18 0 

tgcgcgcgan nnnnnnnnnt ctctctctgt ggcgnnnnnn imnnrmtctc ttattttata 24 0 

t.tttgggggg cggggggcct cccctccccc ctgtgtgcct 28 0 

<210> 56 

<211> 398 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (398) 
<223> n = A,T,C or G 



<400> 56 

ggcacgaggt 

cgatgcctgc 

ttacnnnnnn 

cccctaaaat 

gggcggnnnn 

attttttgct 

cnnnnnnnnn 



ccacctcagc 
ccatcagttc 
nnnnnnnnca 
tcatggagag 
nnnnnnnngt 
ttttttggac 
nnnnnnaacc 



tcagcaatct 
tttactctga 
atacccaacc 
gcctatttcg 
gaagggggga 
acattttgcg 
tcaacgaggg 



catgccggtt 
ggtgttagag 
ccctcccatt 
tagccagcca 
aaaaaaagcc 
ggacaaagaa 
gggggcgg 



ggcaattagt 
tggaataaaa 
nnnnnnnnnn 
ctatataaac 
tttttttgaa 
ccctgtaaaa 



cagcataagc 
atataaatac 
nnngcccgcc 
cctgctggtt 
aaaattagtc 
cccccctatt 



60 
120 
180 
240 
300 
360 
398 



<210> 57 

<211> 386 

<212> DNA 

<213> Homo sapiens 



<400> 57 

ggcacgagat 

aagcgcttcc 

tccctcataa 

ccctagcccc 

acttgggtgg 

tccgggttcc 

gggggtcaag 



tttcttggca 
gtgaacaccc 
gcactttggt 
aggtctagcc 
gaccaaggag 
attcttttca 
tccaggccca 



ataagcggac 
ccgtcctccg 
tttaccgcct 
gggcccattg 
atgccaccat 
ccagcaccca 
cccccg 



tctgggactc 
tagggggagg 
gcaacctcac 
cagggggcag 
agaccttfccc 
tcgcccaagg 



cggctcccta 
ggagcaggcg 
tgtgcccgcc 
cacttggggg 
ctcgccttct 
ggtaccgagg 



ccccaaactg 
ggatcctggg 
ccgcaccatg 
catctccggc 
tcctccctag 
gggggcaggg 



60 
120 
180 
240 
300 
360 
386 



<210> 58 
<211> 202 
<212> DNA 
<213> Homo 



sapiens 



<400> 58 

cactttttct 

tctttctgcc 

cagttgtggt 

gctttatttt 



atatgaatat cttggccgta 
cccacctgcg ccaggggaga 
gataattttt tatgctgaac 
tgttatgtgt tt 



tcatagactc aaaaaagaaa ttatgcaagt 
agtttacctt cgggaactcc agagttaaag 
acaccacgat ataaaaaaca acattcacgt 



60 
120 
180 
202 



<210> 59 

<211> 394 

<212> DNA 

<213> Homo 



sapiens 



<400> 59 

ggcacgagtc 

agtttgtaat 

actcagcaat 

tctttccatt 

cactgagtcc 

agaaagaacc 



tgcttctgtc 
gaagacagct 
gcactttagg 
ccttttccct 
tctaactaat 
tccatataca 



actgtcacat 
ccttctcatc 
actgagctag 
ctgaaacaca 
catatgtgct 
ttaatttttt 



agacagccct 
tttccataag 
gaggcaaata 
caaaatacca 
cagacacagc 
tctgcctaaa 



gcatgccccc 
cctgagatac 
tctgaagctt 
aaggaactta 
tcaagcacac 
aataaaattg 



tgtctcacac 
aagttcaggg 
gctatgctgt 
cgcaacacac 
cccttagtta 
cgttgtggca 



60 
12 0 
180 
240 
300 
360 
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gcaatttgga aactacagca aagtctccaa aaaa 



394 



<210> 60 

<211> 246 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (246) 
<223> n = A, T,C or G 

<400> 60 

cccctccttt tttaggcctg aatacaaagt agaagatcac tttccttcac tgtgctgaga 60 

atttctagat actacagntc ttactcctct cttccctttg ttattcaggg tgaccaggat 120 

ggcgggaggg gatctgtgtc actgtaggta ctgtgcccag gaaggctggg tgaagtgacc 18 0 

atctaaattg caggatggtg aaattatccc catctgtcct aatgggctta cctcctcttt 240 

gccttn 24 6 

<210> 61 

<211> 395 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 
<222> (1) . . . (395) 
<223> n = A,T,C or G 



<400> 61 

ggcacgagct 

nnnnnnnnnn 

nnnnnnnnnn 

gagacaaact 

gaaagtttaa 

tttgagaaca 

tccaggttgg 



tgcttccctc 
nnnnnnnnnn 
nnnnnncttc 
attgcttaga 
gctctcctca 
actgaggact 
attcatgctt 



tcaccctctg 
nnnnnnnnnn 
catatcgtaa 
ggatgctgac 
ctatgagttt 
cttgtgacca 
cgcaccccaa 



cagtttccnn 
nnnnnnnnnn 
actgccttgg 
aggagcagca 
tcaagtataa 
ggacaacagg 
aggct 



nnnnnnnnnn 
nnnnnnnnnn 
aaccaattac 
tgccaaaatt 
aagacttttt 
gaagcttgca 



nnnnnnnnnn 
nnnnnnnnnn 
cactaccagg 
ggaagaagga 
cttccacgat 
gcaagatagc 



60 
120 
180 
240 
300 
360 
395 



<210> 62 

<211> 387 

<212> DNA 

<213> Homo sapiens 



<400> 62 

ggcacgaggc 

cttgacaaag 

aaagggatct 

aagtcaacct 

gctgaaaaca 

atagaaggag 

aaactttgag 

<210> 63 
<211> 401 
<212> DNA 
<213> Homo 

<400> 63 

ggcacgaggg 

actgggaaaa 

cctgaggaag 

caacaggacc 

gcttttactg 



ttgcggggca 
agaagacagc 
gattggtgac 
tcatctttta 
attaagagtt 
qatagtgttc 
taacatagca 



sapiens 



aaactgtatg 
acagctgccc 
gaatctcagg 
gtgaatccaa 
taagtgatca 



ttatgactag 
aacaactgta 
ctctgggctt 
caatagaaat 
atgggtactt 
tatacagtca 
gtctatg 



agggttggtg 
acaattatct 
tatcaaatta 
catatgtttt 
aaaaaaatca 
cacacataca 



acaggagaat 
cgagttgcaa 
cgtatcttag 
atatttcaag 
cgccccggaa 



aaaattcaga 
tatgaatatt 
acatcacaac 
gctaacccat 
ttatgtttat 
cttccttatt 



cagaatgtaa 
tgcgaaactc 
ttctagaaga 
tcctatttag 
aaaattagtg 
tcttttattt 



gaatcaggtt 
aactctgggt 
gagaaaatgt 
tgggtttaga 
tgtgctgaag 



tggggctcaa 
cctatatgta 
tctagcttgg 
ggactggagt 
aaaggaaaat 



ggtgccggcc 
taaactatgc 
gaaacaaaca 
tctaaacgct 
gagccagtat 



60 
120 
180 
240 
300 
360 
387 



60 
120 
180 
240 
300 
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cggcgaggac tatgggcaag gaaaacgaga gtgtgcgatg tgtcaaagca agacatctgt 360 
gtatagtaat ataaccaagt aatagatagt catagaatca a 401 

<210> 64 

<211> 274 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 
<222> (1) . . . (274) 
<223> n = A,T,C or G 



<400> 64 

cacgcacccg cctgtgtgtg tgcgcacaca cgctccctct ctctatagac agacacacac 60 

tgcgcgctcg ctctctcttt tgtgtgcgct ctccgtgctc cccccctctc tctctttttt 120 

ctctatatnn nnnnnnnnnn nnnnntctga gagctcgcgc gctcagcgtt ctattcacac 18 0 

gcgcgttttt tttatatata tattttgtgc gcgccggggg gggcgcacac actctctctt 240 

ttttgtgggt tcgctgtccg cgtccctcct tttg 274 



<210> 65 

<211> 279 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (279) 
<223> n = A,T,C or G 



<400> 65 

cccttttttt tatacacccc cccttgtctg tctgttttgt gtgtctgccc cccttctctc 60 

gttgtgatct ccctctctct tttttctccc cccgcgctct ctctctcttg cggggaggct 12 0 

cacatacccc ctctctctct cttttttgaa ccacacattc cgtttctctt ttttttatct 180 

ctacccctct ctcgtctgta ccccccacan nnnnnnnnnn nnnnnnnnnn nnnagagtag 24 0 

agttgcgttc cccactctcc nnnnnnnnnn gtggggtgc 279 



<210> 66 

<211> 311 

<212> DNA 

<213> Homo sapiens 



<400> 66 

caaaacaaaa 

ttaacagttt 

aggggatttt 

ctgtgcacca 

tttctcagaa 

ttatggggct 



attaaaaatg 
aaaggttctt 
tttccaacat 
cgccaaaacc 
gattaacaca 
t 



accccccttt 
ggtcagtttg 
agcgagcggt 
tccagaggtc 
gcgcctgggg 



aaaattttag 
gggaacccca 
ttagattttt 
ttcttttttt 
gtttaaggga 



ggggtccatt 
ccttgagatg 
tttgtcccgt 
acacaccctg 
ggggtgacct 



tttaaaaacc 
ggagcaaaaa 
tagagttgcc 
tctgggggtg 
ccgcaggaca 



60 
120 
180 
240 
300 
311 



<210> 67 
<211> 386 
<212> DNA 
<213> Homo 



sapiens 



<400> 67 

ggcacgaggg 

aacaggaccg 

cttttactgt 

ggcgaggact 

tatagtaata 

ccgcgggagg 

ggggctcgag 



aatctcaggc 
tgaatccaaa 
aagtgatcac 
atgggcaagg 
taatcaagta 
atgaggcaac 
gaaggggaaa 



gtatcttatg 
tatttcaagt 
gccccggaat 
aaaacgagag 
atagatagtc 
tcccatcaga 
agcgca 



agaaaatgtt 
gggtttagag 
gtgctgaaga 
tgtgcgatgt 
atagaatcaa 
ttagaaagat 



ctagcttggg 
gactggagtt 
aaggaaaatg 
gtcaaagcaa 
gctgatgtat 
gttaacactg 



aaacaaacac 
ctaaacgctg 
agccagtatc 
gacatctgtg 
ttggqagggg 
taacaaaagt 



60 
120 
180 
240 
300 
360 
386 
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<210> 68 

<-211> 396 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (396) 
<223> n = A,T,C or G 



<400> 68 

ggcacgagga 

ccggtgttct 

cagttggatt 

cgcccacgcc 

gggatgttca 

ctcgccctct 

ttatactgct 



ggcagctgcc 
tcatccggct 
ctccctcttg 
ca.ctctca.cc 
ttagaactct 
cctgggttcc 
tacaaccatg 



tttgtttgcc 
gcacccccaa 
gatctggctc 
cggaaccttt 
gaaaactaca 
gcctgggtgc 
ctgagtctgc 



atggatgggt 
cagagctctt 
tgccttagtc 
catctcttat 
gttctcccct 
agagtgagcc 
aaggan 



aggggctgca 
tcttccccag 
cgacctagag 
tgaagccttt 
ttatgaggac 
catgggacag 



ctgagcagca 
atccctttta 
ggatcagctt 
taggcccatt 
tgcaccacag 
ccctctgaaa 



60 
120 
180 
240 
300 
360 
396 



<210> 69 
<211> 397 
<212> DNA 
<213> Homo 



sapiens 



<400> 69 

ggcacgagtc 

cttcctgctg 

aggcctctcg 

cgccacacac 

agaggccctt 

ctggcctgga 

ccgtacccca 



ttagtcaaca 
gacatggggg 
agcatccaaa 
aacctctcca 
gagtcacaga 
tccccatggt 
aaacaatgtc 



tggacaacaa 
agctggacgg 
ccctcacgac 
catgcctcag 
cccacagacg 
atgcttgaat 
accatggtta 



catcattcag 
caaaattcag 
ctgcaagggg 
cgctgttact 
tcagggccag 
ctgctccctg 
ccaccta 



cattacagca 
atcatcctta 
ccagcaggga 
tgaatgcctt 
ggagagacct 
aacttcctgc 



accacgtcgc 
aggagctgta 
cgtggcccca 
ccctgaggga 
agggggtccc 
cagtgcctcc 



60 
120 
180 
240 
300 
360 
397 



<210> 70 
<211> 394 
<212> DNA 
<213> Homo 



sapiens 



<400> 70 

ggcacgagcc 

acagaaatgg 

tgaagcgctt 

ggtccctcat 

tgccctagcc 

gcacttgggt 

agtccgggtt 



aaacctagca 
attttcttgg 
ccgtgaacac 
aagcactttg 
ccaggtctag 
gggaccaagg 
ccattctttt 



caaaacgggg 
caataagcgg 
ccccgtcctc 
gttttaccgc 
ccgggcccat 
agatgccacc 
caccagcacc 



ttcacaagcc 
actctgggac 
cgtaggggga 
ctgcaacctc 
tgcagggggc 
atagaccttt 
catc 



atggtcgggg 
tccggctccc 
ggggagcagg 
actgtgcccg 
agcacttggg 
ccctcgcctt 



tccggggggg 
taccccaaac 
cgggatcctg 
ccccgcacca 
ggcatctccg 
cttcctccct 



60 
120 
180 
240 
300 
360 
394 



<210> 71 
<211> 389 
<212> DNA 
<213> Homo 



sapiens 



<400> 71 

ggcacgagga 

cttttattca 

atcacagccc 

cccatagcct 

ggtgcttgtg 

tcgccttaca 

tcatcaaggc 



aagttaagca 
acgcaatgac 
agcaaggtaa 
tgaagttctc 
cattccacca 
atgtctgttc 
atctgtagag 



tctacaggtt 
agcactgacc 
ctggacagtt 
acacctttgc 
tgagctagag 
tggagaaaga 
atcagtgag 



atggctttgg 
gaagaggcag 
aacaaaacag 
ctggaagatc 
aaagccatct 
cgaccactgt 



gagttccaat 
ccgtgactgt 
aagctgacaa 
ataacagtta 
gcaggtgtct 
gaagcctttg 



atcagtctat 
aacacctcca 
catagaagga 
ctgcatcaac 
aaaattgaaa 
tgaagaattt 



60 
120 
180 
240 
300 
360 
389 



<210> 72 
<211> 396 



-19- 



WO 01/72781 



PCT/US01/09952 



<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (396) 
<223> n = A,T,C or G 



<400> 72 

ggcacgaggc ctggccccac agcggggcag caggatctct gtgctgtcag ccagcccagt 60 

gtctgatgtc agctatatgt ttggaagcag ccagtccctc ctgcactcca gcaactccag 120 

ccatcagtca tcttccagat ccttggaaag tccagccaac tcttcctcca gcctccacag 180 

ccttggctca gtgtccctgt gtacaagacc cagtgacttc caggctccca gaaaccccac 240 

cctaaccatg ggccaaccca gaacacccca ctctccacca ctggccaaag aacatgccag 300 

cagctgcccc ccatccatca ccaactccat ggtggacata cccattgtgc tgatcaacgg 360 

ctgcccagaa ccagggtctt ctccacccca gcggan 396 



<210> 73 

<211> 386 

<212> DNA 

<213> Homo sapiens 



<400> 73 

ggcacgaggc cacctgttgc cctaacaccc tgtctgactc tctcccgctg cagcagccag 60 

tccctcctgc actccagcaa ctccagccat cagtcatctt ccagatcctt ggaaagtcca 12 0 

gccaactctt cctccagcct ccacagcctt ggctcagtgt ccctgtgtac aagacccagt 180 

gacttccagg ctcccagaaa ccccacccta accatgggcc aacccagaac accccactct 240 

ccaccactgg ccaaagaaca tgccagcagc tgccccccat ccatcaccaa ctccatggtg 300 

gacataccca ttgtgctgat caacggctgc ccagaaccag ggtcttctcc accccagcgg 360 

accccaggac accagaactc cgttca 386 



<210> 74 

<211> 390 

<212> DNA 

<213> Homo sapiens 



<400> 74 

ggcacgagct 

ttgcgttccg 

tctttctgct 

ccatctgctg 

cgccatctgc 

ttcagaaaag 

agatttctgt 

<210> 75 
<211> 399 
<212> DNA 
<213> Homo 



cagatccggg 
gtgtgggtgt 
tagctacccg 
ctcgcctcag 
gttttgagca 
acttgcaagc 
aaagatcatt 



sapiens 



gactgcggat 
gggtgtgcct 
gagccgacta 
ctgcctgggt 
ctgttctcat 
agctagattt 
ttccatacgg 



aaatggcctt 
ccgacggcgt 
cggaggagga 
ccccgacgag 
aatggagttt 
tcttccagta 



aggccgcggg 
ctcggtgcca 
cacctgagtt 
agccaggtga 
cctgatttgg 
aaatgtgatg 



cagcgagatg 
gtgtcgaggt 
tacgtctctt 
cacttaactc 
ggaagcattg 
catgtaaaca 



60 
120 
180 
240 
300 
360 
390 



<400> 75 

ggcacgagaa 

gtgtgcctcc 

gccgactacg 

gcctgggtcc 

gttctcataa 

ctagattttc 

ccatacgctg 

<210> 76 
<211> 386 
<212> DNA 
<213> Homo 



atggccttag 
gacggcgtct 
gaggaggaca 
ccgacgagag 
tggagtttcc 
ttccagtaaa 
cacataagtg 



sapiens 



gccgcgggca 
cggtgccagt 
cctgagttta 
ccaggtgaca 
tgatttgggg 
atgtgatgca 
tccgtttgca 



gcgagatgtt 
gtcgaggttc 
cgtctcttcc 
cttaactccg 
aagcattgtt 
tgtaaacaag 
ttccagaag* 



gcgttccggt 
tttctgctta 
atctgctgct 
ccatctgcgt 
cagaaaagac 
atttctgtaa 



gtgggtgtgg 
gctacccgga 
cgcctcagct 
tttgagcact 
ttgcaagcag 
agatcatttt 



60 
120 
180 
240 
300 
360 
399 
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<400> 76 

ggcacgagca 

ccgctgcccg 

gcgggggcca 

agccgctctt 

gaactcttaa 

ttgcctgagg 

cagcaacaac 



aaggctcgca 
tgagctaagg 
ggggcccctc 
ctaaaacatt 
aacatgcaga 
gagaggatct 
atggtttttg 



gcggccagaa 
acggtccgct 
gcctcccctc 
cataccaaag 
agcaactcta 
caatgaatgg 
ggaact 



acccggctcc 
ccctctatcc 
tgaggaccga 
aagaatatcc 
ggaagaggga 
agtgctgcga 



gagcggcggc 
agctccgaat 
agatgagctt 
ctgatggatc 
atctgagaca 
acacctgggg 



ggcccggctt 
cctgatccag 
cctcttcagc 
tcatcagtat 
agctgctatg 
attcttttac 



<210> 77 

<211> 395 

<212> DNA 

<213> Homo sapiens 



<400> 77 

ggcacgaggc 

ttgttatttc 

tttcaagcta 

gccagaccca 

tttggtttca 

agaattcaac 

cagacagctc 



catctccaaa 
acttgaccat 
cagcttcgtg 
aatactattt 
gagggtatga 
aaatagagat 
tttggctcaa 



tactgcggtt 
ctgtacaaca 
gccaacaaag 
tgcatggacg 
cgaaaatgga 
gtcaacaagt 
aagggtgtgt 



gttcagaagc 
ttacctgtgg 
attatctatc 
ctagctgatg 
gcagctcgct 
ttcaagatgt 
tatga 



tcttagtttg 
agtacaacat 
tgtatatctc 
ccattaataa 
gggacttaat 
ttcttgataa 



tgggctgtcc 
tgatgagcat 
tcttttggct 
tgctgcaggc 
ttccaatttg 
ttggaatatt 



<210> 78 

<211> 389 

<212> DNA 

<213> Homo sapiens 



<400> 78 

ggcacgaggc 

ccttggcagt 

gccaacaagg 

gaggatgcct 

gtggtgtttc 

gaaggagtgc 

cagcagccag 



aggccgggat 
ttgagaggaa 
tcgtgtacaa 
atgtattccc 
atccattcct 
acgtctctct 
ccaagttcga 



gttcgtcctg 
gctcaacgac 
cgtgggactc 
tggggatggc 
agatgagatt 
aggcttcttc 
cgaagcgga 



gtggaaatgg 
tccattgccg 
tgcatttgtc 
gcatcacaca 
ctcattggga 
gatgacattc 



tggacaccgt 
aggagctgaa 
tgtttgatat 
ccaaagtcca 
agatcaaagg 
tcatcccccc 



ccggatcccc 
caagaagttg 
caccaaactg 
ttttcgctgc 
ctgcagccca 
agagtcactg 



<210> 79 

<211> 365 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (365) 
<223> n = A, T,C or G 



60 
120 
180 
240 
300 
360 
386 



60 
120 
180 
240 
300 
360 
395 



60 
120 
180 
240 
300 
360 
389 



<400> 79 

ggcacgagaa 

tgtataatac 

gatgtatctg 

ttttatcccg 

nnnnnnnnnn 

caggggcgga 

ctcac 



aacatttcat 
acatttgcat 
tgacctcaac 
cccccacccc 
nnnnnnnnnn 
atctgggtta 



cttgattttt 
gcattaggaa 
gagtaggcac 
caacctcagc 
nnnnnaaaac 
attgaaccct 



attaaggtga 
gttttttttg 
ttctgtactg 
ctcaagtatg 
aaagccccgt 
ttgcttttgg 



tatgtatgtt 
ggttttattc 
tactggtttc 
taannnnnnn 
tttgtcccca 
ggttaaggca 



acttaacagc 
atcctgtagt 
ttaaagtttc 
nnnnnnnnnn 
ggctggataa 
attttcctgc 



60 
120 
180 
240 
300 
360 
365 



<210> 80 

<211> 376 

<212> DNA 

<213> Homo sapiens 

<400> 80 

ggcacgagct ggaaaccagc ccctaagctg ctttcacctt cggcccattt ctcactagcc 



60 
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agcctctccc 
gccaccgtga 
cctttgcatc 
aattctggct 
atatgttaat 
atcatgacgt 



acggcctccc 
tttatttaca 
taaagtagcc 
ttccatgacc 
gatcttaatt 
gagaac 



cagtttcttc 
actttttgtc 
cctctcatcc 
cataaaccac 
ctacttttag 



aaatacaccc 
cggattacct 
cccaaatctt 
atttctcaag 
tgcaattttc 



cctccactat 
cagtagcctt 
accatgtcac 
tgtgctctat 
ttagagctgg 



tcaccatact 
ctaattgtcc 
tcttctacat 
gctggcttga 
catcactttc 



120 
180 
240 
300 
360 
376 



<210> 81 
<211> 384 
<212> DNA 
<213> Homo 



sapiens 



<400> 81 

ggcacgagag 

ggagggcccc 

attgctgtga 

tacatgaata 

cttagactct 

gcctccttgc 

tttcacgtgc 



gattgtgtga 
ggagcagccc 
tgccagcagc 
ggtgtggcag 
tgcgagagcc 
cccttagttc 
tctcggtgga 



aattgtgcaa 
acctggggtc 
ccactgcaag 
gggtgggggc 
tgcaccccat 
tggtgtggtg 
gate 



atgcatgaat 
ctatctagta 
catcctcttc 
tcctgaagac 
atcaggtggg 
ggcaagtcag 



gtgggctggg 
gacgcgcccg 
ctttccaagg 
caactagggg 
gtcaatagat 
aggaactgtt 



atagtaaaag 
gtgcccaccc 
ttctgtctgg 
tactagggac 
aaatacccct 
cttctcacac 



60 
120 
180 
240 
300 
360 
384 



<210> 82 
<211> 383 
<212> DNA 
<213> Homo 



sapiens 



<400> 82 

ggcacgagca 

ccgctgcccg 

gegggggeca 

agccgctctt 

gaactcttaa 

ttgectgagg 

caggatcaca 



aaggctcgea 
tgagctaagg 
ggggcccctc 
ctaaaacatt 
aacatgeaga 
gagaggatct 
atggtaatat 



gcggccagaa 
acggtccgct 
gcctcccctc 
caaaccaaag 
agcaactcta 
caatgaatgg 

ggg 



acccggctcc 
ccctctagcc 
tgaggaccga 
aagaatatcc 
ggaagtggga 
attgctgtga 



gageggegge 
agctccgaat 
agatgagctt 
ctgaaggatc 
atctgagaca 
acaactgggg 



ggcccggctt 
cctgatccag 
cctcttcagc 
tcatcagtat 
agctgttatg 
atttctttac 



60 
120 
180 
240 
300 
360 
383 



<210> 83 

<211> 358 

<212> DNA 

<213> Homo sapiens 



<400> 83 

ggcacgagca 

cggcgctgga 

gctgtcgccg 

gcttatatat 

tggagactga 

ttggctcgaa 



gggccgcgcg 
ggegeggett 
cgctggaata 
ggaagaaatt 
aaaaaactcc 
tgcacgatga 



gcggtgatca 
ggcagaegge 
tctgccccag 
gaaggctcag 
ccagggtctc 
agacctcatt 



agcaccgctt 
ggacggtgca 
ttgtcttttt 
tgactgttcg 
tccaacttag 
catggtgatc 



ccccaagggc 
ggaggecegg 
tgtggactat 
agattatatt 
ccaagacaat 
tcaccacctc 



taccggcacc 
gcgctcctcc 
gcttccaact 
cagtccacta 
tgggcaggtt 
caacatgc 



60 
120 
180 
240 
300 
358 



<210> 84 
<211> 338 
<212> DNA 
<213> Homo 



sapiens 



<400> 84 

aagatggctg 

tgagtctacc 

atgtggtttt 

gcccctcaga 

gtgatcgege 

ctgtacacga 



agagggacag 
aaatgeacac 
tetaegcatt 
acctctctgt 
ctggagagac 
gccacatctg 



aatgetttat 
tttcacaatg 
gattacatgt 
actctcaacc 
agtgtactat 
gattcccagc 



tttggagaga 
ggtctagaag 
ttgctcacag 
aacatgaagc 
tctgtcgaat 
agctggtg 



aacaatgttc 
aaatctggac 
atgaagtggc 
atctcttgat 
accaggggga 



taggtcaaac 
aagtcttttc 
cattctgcct 
gtggagccca 
gtacgagagc 



60 
120 
180 
240 
300 
338 



<210> 85 
<211> 475 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (475) 
<223> n = A,T,C or G 



<400> 85 

gtcgctcaat aggcaggagt ccatcgattc gaattcggca cgagnnnnnn nnnnnnnnnn 60 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnc gctccactgt gcactcctga cacatacttt ccccgctaca ctctctattc 18 0 

tccccctctt gtgttctctc tctatagcgg tagatagaga ggcctgtgtg tagataataa 240 

acgtgtgtgt gtgtgtaaga aaggagacac aaacacgccc acnnnnnnnn nnnttggggc 300 

ctttttttct tttgagccct ttggggaaaa aacccgggga aaacagccca tacccactat 360 

ttggggcgcg ccaaaaaacc ttctttaaaa aaaatgtgtt aaatgttaaa ttttttagga 420 

acannnnnnn nnnngcaaaa aatagcaccc caaaagcagg ggttttacat ttttg 475 



<210> 86 

<211> 467 

<212> DNA 

<213> Homo sapiens 



<400> 86 

gagcgatttt ctgcaggatt ctatcgattc gaattcggca cgagccatgg tctcagtgag 60 

ggctggaatt tacagagaag tttggccagg gggtccacca tgctgccagt cagtttggga 12 0 

aggaaacaga gaagctcggc catggggtcc accatggggt taatgaggcc tggaaggaag 180 

cagagaagtt tggccagggt gtccaccatg ctgcctcgca ggtggggaag gaggaagaca 24 0 

gagtggtcca aggcctccat catggcgtta gtcaggctgg aagggaggcg gggcagtttg 300 

gccacgacat tcaccacaca gcagggcagg ctgggaaaga gggagacata gcagttcatg 360 

gtgtccaacc tggggtccac gaggccggga aggaggcagg gcaatttggc cagggagttc 42 0 

accataccct tgaacaggcc gggaaggaag caaacaaagc ggtccag 467 



<210> 87 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (449) 
<223> n « A,T,C or G 



gacggattct cccgttccga atagcctttt 60 

agagatcang gatggaaatc aagttcgagt 12 0 

actggagggg gacacctatg accgggtgct 18 0 

ctcccttcat gaggaggaga acaacatctt 240 

attgcctgtg ctgcaagtgc agcttcttgc 300 

ccatgttgtc tattctacct gctcactctc 360 

tgccattgag ctcctgggca atcaatacag 420 

449 

<210> 88 

<211> 439 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . „ . (439) 
<223> n = A, T,C or G 



<400> 87 
cggggtggga 
acagaagatt 
tacctcatgg 
ggtggatgtg 
taagcggtca 
ggctggactc 
acacttacag 
catccaggta 



aaccngannt 
cttcacagct 
gatggcagga 
ccctgtacca 
aggaagaagg 
cttgccacca 
aacgagtatg 
caggtggaag 



tnannaancg 
atgtgcctga 
aatggggaga 
cagaccgcca 
agcgacagat 
aaccaggagg 
tggtgcaagg 
atctgactg 
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<400> 88 

gtagtgtatg 

gccccagccc 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

gttcccaccc 

nagtctcgta 

cttagataga 



tgcagcctcc 
aggagagata 
nnnnnnnnnn 
nnnnnnnnnn 
actcttagac 
tgcctggata 
ccagattcaa 
agaggaggc 



catcgattcg 
aaagcatcag 
nnnnnnnnnn 
nnnnnnnnnn 
agcaaaaatg 
actactgttc 
aaatcagtca 



aattcggcac 
caccatgaga 
nnnnnnnnnn 
nnnnnnnnnn 
ctttctccca 
ttggtttnnn 
actacttcaa 



gagatcccct 
ttcacctgcc 
nnnnnnnnnn 
nnnnnnnnnn 
gtcttgttcc 
nnnnnnnnnn 
aaacaatgac 



cttatatgat 
tctggtcgtn 
nnnnnnnnnn 
nnnnnnnnnn 
cttgttctca 
nnnnnnnnnn 
atgctggcta 



60 
120 
180 
240 
300 
360 
420 
439 



<210> 89 

<211> 436 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (436) 
<223> n « A,T,C or G 



<400> 89 

ggcacgagca tcaaatagta aatatagatc ttatgctgga aatgtcaacc tccctggcag 60 

ctgtaacgcc catcattgaa agggaaagcg gaggacacca ttatgttaat atgactttac 120 

ctgtcgatgc agttatatct gttgctccag aagaaacatg gggaaaagtt cgtaaactcc 180 

tggttgatgc aattcataat caactaactg acatggaaaa atgtattttg aaatatatga 24 0 

aaggaacatc tattgtggtc cctgaaccac tgcacttttt attaccaggg aaaaaaaatc 300 

ttgtaacaat ttcatatcct tcaggaatac cagatggcca gctgcaggcc tataggaagg 360 

agttacatga tcttttcaat ctgcctcacg acagacccta tttcaaaagg tctaatgctt 420 

atcactttcc agatgn 436 



<210> 90 

<211> 437 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 
<222> (1) . . . (437) 
<223> n = A,T,C or G 



<400> 90 

ggcacgagag 

gtgtgagcgc 

tcgcaaggag 

gaacctcatg 

ctccgtcatc 

ggatgagctg 

agagctcaag 

cttggagttg 



atcatgcact 
catggggagc 
gaggactgca 
ccacctcttc 
agggactacc 
cgggtgcggc 
gccagtccta 
ccctcan 



accacatgca 
aggacccctc 
aggagtatgt 
tagtggtgca 
tggtccaaaa 
ggtaccgaga 
agattttcca 



gcacgagcag 
cttgtgggag 
ggcagctgtc 
gaccctggcc 
actacagaaa 
ggagaccacc 
aaagaccaag 



taccggcagg 
caggccctca 
ctcaagcata 
cacaactcca 
cagagccagc 
cgtatccgcc 
tgcagcatct 



tcatcagcgt 
gctacttcgc 
tcgagaacaa 
cagccacact 
agattgcaca 
aggagatcca 
gtaacagtgc 



60 
120 
180 
240 
300 
360 
420 
437 



<210> 91 
<211> 437 
<212> DNA 
<213> Homo 



sapiens 



<400> 91 

ggcacgagct 

atttctttga 

accaacttgc 

aagtaaaaaa 

agaacttact 

actttgaagg 

ttcaggacat 



tcagtcttat 
aggtgttgaa 
atttaacaaa 
aggtctagat 
tcaggtggtg 
tttgatagct 
tctggattat 



gtcatttact 
gctcgcgtgg 
caagaacttc 
aacctctaca 
tggcactcca 
cgctgttatc 
tgttccagca 



ctttaggaca 
cacagggcat 
gtaaagtcat 
agaaagttga 
tgcaagatga 
ctggatctgg 
ttgcacagtc 



acctcttgaa 
aagggaggag 
taaggagtac 
taaacattta 
atttatacgc 
tgttacaatg 
ccactaaacc 



aaactaaatc 
gaagtaagtt 
cctggaaagg 
tgtgaagaag 
cagtataagc 
gaattcacta 
ttgtgaaaga 



60 
120 
180 
240 
300 
360 
420 
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agaaaagata actgaat 

<210> 92 

<211> 427 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> (1) . . . (427) 
<223> n A,T,C or G 



437 



<400> 92 

aacggctctt ctncttttga ggagcccatc gagtcgaatt cggcacgagg cgagtctctg 60 

ggtcgcgacg ggaaggagtg aaacacctct ctgcgcctgc gcgctccgtg cctgcgaagc 120 

aaacccggcc tcaccttttc ctgcccgaag cagaagattc tcgcaggcct ggtttctccc 180 

tccagaagac cccccaccca aatcctctgt agctcctggg agtgccctga cccctgctgc 240 

caccgtcctt cagagagcaa cggaagagct tcccggaggg cgaggaaaag agggaaagta 300 

gccagcaatg tcgaacgcag tgtataataa gatgtggcat cagacccaag aagccctcgg 360 

tgctttactc gatgaagagc ctcagacgat gattgaacca cacagaaatc aggttttcat 420 

ctttcaa " ' 427 



<210> 93 

<211> 429 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (429) 
<223> n = A,T,C or G 



<400> 93 

gtgacgatcc catcattcaa ttcggcacga gctcacagcc aaagttcctt ctgcccccag 60 

gctgagagtg cttgatacac ccttgaatcc cctcttatat gatgccccag cccaggagag 12 0 

ataaaagcat cagcaccatg agattcacct gcctctggtc gttagggaac aatggaggcc 180 

tgcgatttgg agttaaactc tcagtgatct ctgtgttgac aacaccaaag ctagaggaat 24 0 

ccagtaggat gtgggcatgg ttttcccgga aggctgactg agcagttctg caaatgtttg 300 

caagtacagg gcagaatttc atccagcctc agaaccttga gccaagactc agcatcagca 360 

aagccaaaag tttcatttct tcgactgtgg gagtgctagt cccaaccttt agatggccat 420 

tcagttnta 429 



<210> 94 

<211> 421 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (421) 
<223> n = A,T,C or G 



<400> 94 

ggcacgagat tatttacttg gtgtgtggtc accactgttt tttaaatgag tgttttcatt 60 

tgtatcaaac tggacctgct ttcctcaagg attgcccaaa aggagacaca aatttactaa 120 

acacttatca ataatagaac accgtgctag gcaatttcca tatactatta atttaatcct 180 

cacaataact ttggaagaca gaaagtattt tctctgannn nnnnnnnnnn nnnnnnnnnn 240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn atcctctgtc tccaaagcct gtacttcatt caggacactt tcccccacat 360 

ttagaaaagc tgtaattatc ttccagtgag acagcatagc acatgtgatc actgtccctt 420 

c ~ " 421 



<210> 95 
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<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 95 

ggcacgagat gagaagataa aattcagcgt tggcctttag actttgccat ccttaaggag 60 

tgatggaagc caagtgaaca agcctcagtg acacaagtca aattcatagt ttcactctgg 120 

gttttttgtt gttgtgtggt tattattctc actacagaaa gactgagttt catgctcctg 18 0 

gctatgtcag atgtgaattt tcatgggtaa ctggacagtt aacaaaacag aagctgacaa 240 

catagaagga cccatagcct tgaagttctc acacctttgc ctggaagatc ataacagtta 3 00 

ctgcatcaac ggtgcttgtg cattccacca tgagctagag aaagccatct gcaggtgttt 360 

tactggttat actggagaaa ggtgtctaaa attgaaatcg ccttacaatg tctgttctgg 420 

a " * " ~ " 421 



<210> 


96 


<211> 


418 


<212> 


DNA 


<213> 


Homo i 


<220> 




<221> 


misc : 


<222> 


(1) • - 


<223> 


n = A, 



sapiens 



(418) 



<400> 96 

tggatccatc gattcaattc ggcacgaggt . tatttttaag aacttttgct tactatattg 60 

gatttacctg cggtgtgagt agctttaaat gtttgtgttt atacagataa gaaatgctat 12 0 

ttctttctgg ttcctgcagc cattgaaaaa cctttttcct tgcaaattat aatgtttttg 18 0 

atagattttt atcaactgtg ggaaaccaaa cacaaagctg ataacctttc ttaaaaacga 240 

cccagtcaca gtaaagaaga cacaagannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnng 418 



<210> 97 

<211> 418 

<212> DNA 

<213> Homo sapiens 



<400> 97 

atgctacatt gctactttgt tgcattgatc gccagacgac cactagattc gaaactgagc 60 

gcgagatgat gaatctgtgt gttatgaaaa tgcatgctac cagtataaaa tgacattctc 12 0 

tattaataac atctgcggtg cgacacacat aattgtccca atttttaata ttgatgggga 180 

gcatgaagca tttttttaat gtgttggcag gccccattaa atgcataaac tgcataggac 240 

tcatgtggtc tgaatgtatt ttagggcttt ctgggaattg tcttgacaga gaacctcagc 3 00 

tggacaaagc agccttgatc tgagtgagct aactgacaca atgaaactgt caggcatgtt 360 

tctgctcctc tctctggctc ttttctgctt tttaacaggt gtcttcagtc agggaggg 418 



<210> 98 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (417) 
<223> n = A,T,C or G 



<400> 98 

catcgattcg aattcggcac gaggccaagt ggacaggcca tagcccccac agactggagg 60 

gacgcggcta gggaatgt cc cacagagtgg ccagttatcc ctgagagaaa gagcaggttt 12 0 

tagcggagac tctgaggctg ctttagaata tggtgggtgt gtggggcaaa agggacaccc 18 0 

aggggtgtat caagaggtca tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnntinnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnn 



360 
417 



<210> 99 

<211> 416 

<212> DNA 

<213> Homo sapiens 



<400> 99 

ggcacgagct 

agatggcccc 

gagaactatg 

ctgggcttgg 

ccagaggagg 

gcccttcggg 

ccagcctatc 



acctcagccc 
gaagggagca 
agtgggactc 
ccagctcccg 
gctgcctcct 
aggaattcct 
gacagccagt 



tgctccagga 
tgtggtgaca 
agaattccct 
gctcagacct 
gaacactgcc 
ggccttccgc 
cccccacccc 



<210> 100 
<211> 417 
<212> DNA 
<213> Homo sapiens 



<400> 100 

ggcacgaggg 

ggtgcgacac 

taatgtgttg 

tattttaggg 

gatctgagtg 

gctcttttct 

ttccaggaca 



aaaatgtagg 
acataattgt 
gcaggcccca 
ctttctggga 
agctaactga 
gctttttaac 
ccaaggtcta 



<210> 101 
<211> 412 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (412) 
<223> n = A,T,C or G 



gacaccagca 
gtcagcaaga 
ggggacatgg 
gaagctgagc 
catgttactg 
cgccgccgag 
gaacaggcca 



gctgggccag 
ggaggaacac 
aattgctgga 
cagagctagg 
gccctgaggc 
atgctactag 
ctctgctgtg 



ctaccagtag 
cccaattttt 
ttaaatgcat 
attgtcttga 
cacaatgaaa 
aggtgtcttc 
ctgcactcgg 



aaaatgacat 
aagattgatg 
aaactgcata 
cagagaacct 
ctgtcaggca 
agtcaaggag 
gaatctaacc 



tggccctgag 
atctgtggac 
gactttgcac 
tgtgaagact 
ccgctgtgct 
ggctcggcta 
aacatt 



tctctattaa 
gggagcatga 
ggactcatgt 
cagctggaca 
tgtttctgct 
gacaggttga 
cacactgtgg 



taagatctga 
agcatttttt 
ggtctgaatg 
aagcagcctt 
cctctctctg 
ctgtggtgag 
ctctgat 



60 
120 
180 
240 
300 
360 
416 



60 
120 
180 
240 
300 
360 
417 



<400> 101 

ggcacgagga 

ttggatttac 

tatttctttc 

ttgatagatt 

cgacccagtc 

nnnnnnnnnn 

nnnnnnnnnn 



aagtaaacgt 
ctgcggtgtg 
tggttcctgc 
tttatcaact 
acagtaaaga 
nnnnnnnnnn 
nnnnnnnnnn 



gtatctcttg 
actagcttta 
agccattgaa 
gtgggaaacc 
agacacaaga 
nnnnnnnnnn 
nnnnnnnnnn 



ttcattttta 
aatgtttgtg 
aaaccttttt 
aaacacaaag 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



tagaactttt 
tttatacaga 
ccttgcaaat 
ctgataacct 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gcatactata 
taagaaatgc 
tataatgttt 
ttcttaaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
nn 



60 
120 
180 
240 
300 
360 
412 



<210> 102 
<211> 414 
<212> DNA 
<213> Homo sapiens 



<400> 102 

ggcacgaggt 

ccagccctcc 

caatgatttt 

ctggcctgcc 

caaggtatta 

gtttatttct 

tgagacctaa 



cttgctcaca 
tttctccaga 
cctaaaacaa 
ctgactccac 
gctgctgtgg 
tactcacgct 
gcagaaatga 



tgttgtacta 
tggcttcttc 
tggaagtgtt 
tttaccagaa 
caaatcaact 
gggtgcactg 
gttcaataat 



ctctctcctg 
ataaccacca 
ttccaaagag 
ccatctgctg 
ctgaaatctc 
ccacttggta 
attgctacac 



gatgtcactt 
ggtcagaaga 
cttataaggc 
ctcttctctc 
cgtgacttaa 
acagaggagc 
tctaggactt 



gtcacctcta 
ggatccgttc 
attgtaggat 
ttgtgttact 
tacaagagag 
tatggaaact 
tctc 



60 
120 
180 
240 
300 
360 
414 
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<210> 103 
<211> 410 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (410) 
<223> n = A,T,C or G 



<400> 103 

ggcacgagga 

aaatgacatt 

agattgatgg 

aactgcatag 

agagaacctc 

tgtcaggcat 

gtcaaggagg 



agagccggga 
ctctattaat 
ggagcatgaa 
gactcatgtg 
agctggacaa 
gtttctgctc 
acaggttgac 



ggatgtattg 
aagatctgag 
gcattttttt 
gtctgaatgt 
agcagccttg 
ctctctctgg 
tgtggtgagt 



gttgttagga 
gtgcgacaca 
aatgtgttgg 
attttagggc 
atctgagtga 
ctcttttctg 
tccaagacac 



aaatgtaggc 
cataattgtc 
caggccccat 
tttctgggaa 
gctaactgac 
ctttttaaca 
ccaaggctan 



taccagtaga 
ccaattttta 
taaatgcata 
ttgtcttgac 
acaatgaaac 
ggtgtcttca 



60 
120 
180 
240 
300 
360 
410 



<210> 104 
<211> 411 
<212> DNA 
<213> Homo sapiens 



<400> 104 

ggcacgagat 

gacagagcta 

aagtagaaag 

gttttgagaa 

tgtagctggc 

ggtctaacat 

tcaggcatgt 



acgaatgggg 
agcccaaagt 
tgacacagta 
gtttcactga 
acctcgtcca 
atctgaaccc 
gaacaaataa 



tgtatttttc 
tgtgattttc 
agagccagaa 
caagttatcc 
tctaatttga 
aggtgatttc 
ccatactgta 



gactgctcgc 
cactctgttc 
tacaccaggt 

tgggctgtgg 

tgggtgtgtg 
tgttctcagg 
aagctggctg 



aggcaccccc 
tgtccatgtc 
gaaggagaga 
gacatcacta 
tggggtgttg 
acgcttttag 
tgctgggtct 



aggttatgtg 
gagggaagat 
attgcattgt 
gctttgaaag 
tgcacgcgtc 
gtgacaagga 
c 



60 
120 
180 
240 
300 
360 
411 



<210> 105 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (413) 
<223> n = A,T,C or G 



<400> 105 

ggcacgagga 

ctctgtagct 

agagcttccc 

taataagatg 

gaagatgatt 

cgtaaagtat 

ccagaaacga 



agattctcgc 
cctggtagtg 
ggagggcgag 
tggcatcaga 
gaaccacaaa 
gtgcagatct 
atactgatca 



agtcctggtt 
ccctgacccc 
gaaaagaggg 
cccaagaagc 
gaaatcaggt 
ttagaaacct 
ggaaagtcct 



tctccctcca 
tgctgccacc 
aaagtagcca 
cctcggtgct 
tttcatcttt 
agagaatgtc 
ggacggngtg 



gaagaccccc 
gtccttcaga 
gcaatgtcga 
ttactcgata 
caaacattag 
tacgaccagt 
atgggccgca 



cacccaaatg 
gagcaacgga 
acgcaatgta 
aagagcctca 
ccaccttcta 
tcgtccaccc 
tec 



60 
120 
180 
240 
300 
360 
413 



<210> 106 
<211> 412 
<212> DNA 
<213> Homo sapiens 



<400> 106 

aggatcccat 

cccggctgga 

aggcagtgaa 

tccgaatgga 

tgaatgactt 



cgattctaat 
ccacgagacc 
acgactaata 
gtttgtgaca 
gaccatcatc 



teggcacgag 
geccattgat 
gctaaaggta 
gctggcctcc 
cagactacac 



ctccataagg 
tgcgctggga 
atacagaaga 
gagctgetat 
agggattttg 



cagaggtcta 
caagaattcq 
actacgaaaa 
gggacctgga 
cagatacctg 



tgcgaggacg 
ttatctttgg 
tgttttgggg 
atttctegta 
gaaaaacaat 



60 
120 
180 
240 
300 
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tcagtgactt atagcagaaa ggcatccgga tcagttatga cgcccgagct catccatcca 360 
gagggggtag catcaaaagg tttgcccgac ttgctgcaac cacatttatc ag 412 

<210> 107 
<211> 408 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> (1) . . . (408) 
<223> n = A,T,C or G 



<400> 107 

ggcacgagga 

cagctcaatt 

cttcaattcc 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

tttcattaaa 



aaaaccagtt 
ctcaaccata 
actttcctct 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
gagagaaatc 



tctcttttat 
ctccaaactc 
gcttacnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
acctattcag 



tgtctgttac 
tctcttttcc 
nnnnnnncnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
gaccggcccc 



taatctctat 
agctaccttt 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
cacctttg 



tctaaagatt 
actccctctc 
nnnnnnnnnn 
nnnnnnggnn 
nnnnnnnnnn 
naatgttttt 



60 
120 
180 
240 
300 
360 
408 



<210> 108 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<400> 108 

ggcacgaggc ttacaggggt gaccagggcc cttcctaact cgaccgcatg tggattggtg 60 

gctggcttgg gagggaggct gtccgatgct gacattcccc ttaacatggc cctgaccgtg 120 

gctgtcaggg gccaccttgc ctcaccaggc cagccccact gggaatgggg tcagtcacag 180 

cagaaccgtc caaaggtgga cctgatgtgg gccctgccgg gggcgcttgg cctcagcggg 24 0 

ccatgggaga cccagtgaaa cgactctagt gtgaggcagt ggtcctgcca ctgactgaca 300 

aaccctcttt gtaagcaaac ttgacaaata atgaatctac tgaactctgt tatagaacaa 360 

gctcattctg catgaacttc tcttattgaa gcagaagcca cgtca 4 05 



<210> 109 
<211> 403 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc^feature 
<222> (1) . . . (403) 
<223> n = A,T,C or G 



<400> 109 

ggatcccatc 

gctcaaggag 

gatccactac 

gagctccctg 

ccgctatatc 

gcacaggcct 

caagctggct 



gnttcgnatt 
cagactgcgt 
ttcacgggcc 
ccctacatga 
ctgctgcccc 
tgactggcag 
ctggaactgt 



cggcacgagg 
accccccgac 
tgcaggcgct 
agatgatctt 
gaatcattga 
accctgccca 
gaggggaagg 



caaccagctc 
acactacatc 
tcagctgctg 
tcccctcatc 
agccaagtac 
cgccccattc 
ggaagatgtg 



gtccagcgcg 
cggagggtgc 
ctgctgtgtg 
atgatcgcca 
ttggatgtca 
gccagccctc 
tgg 



tggccctgct 
cccagaggaa 
ccttcggcat 
tgatccccat 
tggacgctga 
cacgtactcc 



<210> 110 
<211> 397 
<212> DNA 
<213> Homo sapiens 

<400> 110 

ggcacgagtc tgcttctgtc actgtcacat agacagccct gcatgccccc tgtctcacac 
aggttgtaat gaagacagct ccttctcatc tttccataag cctgagatac aagttcaggg 



60 
120 
180 
240 
300 
360 
403 



60 
120 
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actcagcaat gcactttagg actgagctag gaggcaaata tctgaagctt gctatgctgt 18 0 

tctttccatt ccttttccct ctgaaacaca caaaatacca aaggaactta cgcaacacac 240 

cactgagtcc tctaactaat catatgtgct cagacacagc tcaagcacac cccttagtta 300 

agaaagaacc tccatataca ttaatttttt tctgcctaaa aataaaattg cgttgtggca 360 

gcaatttgga aactacagca aagtctccaa aaaaatc 397 

<210> 111 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (401) 
<223> n = A, T, C or G 



<400> 111 
ggcacgagag 

ggggttatgt 

cctcctctcc 
gaagagactc 
ctggcccggc 
agcctcccca 
nnnnnnnnnn 



ccgttgcctt 
ccgctgcttc 
ccgccttctg 
acaggagcgg 
cagccaccgt 
acttgatgtt 
nnnnnnnnnn 



caccgccctt 
ttgggtgccg 
ggaggaggag 
gctgccttcc 
ccccagcacc 
tgcgttttgt 
nnnnnnnnnn 



tctcctttta 
agacatatag 
gtcacacgct 
gcctggggct 
caagcatgca 
ttggggggat 
nnnnnnnnnn 



tcctttttta 
atggtggtct 
gatgggcact 
ccctgtgacc 
attgcctgtc 
atttttcata 



aacgctcttg 
cgggccagcc 
ggagaggcca 
tctcagtccc 
ccccccggcc 
attatttnnn 



60 
120 
180 
240 
300 
360 
401 



<210> 112 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 112 

ggcacgaggg 

ttcctaaacc 

tttttccttc 

cacacagtat 

gactccagtg 

taaaagagct 

attgcctcct 



cagtccagca 
aaacttaaaa 
agaaggaatt 
gcaaagctta 
acagaccaca 
ggggagagta 
gataaaagga 



acaagccttt 
ttctgctttc 
taatgcaaac 
catcatacca 
cctagaaagc 
tgggaagaaa 
aaaagaaatt 



catttacatt 
ctttgagtag 
ggattgcagt 
aggagtggag 
gtttctcttc 
caatacagga 
aatgctggag 



aaattataac 
aaggtattta 
cagcactttc 
agttgaagtt 
ctgagtattt 
ttgcctttaa 

g 



ttttcattca 
acttgttttg 
tgaatgtttt 
tcctcccagt 
caaaaagatg 
ttaattaaga 



60 
120 
180 
240 
300 
360 
401 



<210> 113 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (401) 
<223> n = A,T,C or G 



<400> 113 

ggcacgaggc 

ctggagcagg 

agggaaaggg 

gaggtggaaa 

cagcgggaga 

ggagaatcta 

accttcaatg 



cccacgggcc 
tgggcaccat 
cagtggaatg 
gggccatcct 
aagagctgcc 
ttctgaaaac 
gtaaaagcaa 



ccatctcccc 
ggctttgtgg 
gcgggagagg 
ggagatgaag 
tgcagcagta 
attgactcta 
gggtaatttg 



acaggcattg 
gccagccaaa 
gctgtggaaa 
tggaaggtga 
catcccttcc 
gacttgtaga 
gttgacattt 



agggtaactg 
gggaaaagga 
aaagggagcg 
gggctgagaa 
attttgttta 
anagagccat 
t 



gggtaggctc 
ggtgcttagg 
agccctggag 
ggaggcatgc 
aattgggctt 
tttaattttc 



60 
120 
180 
240 
300 
360 
401 



<210> 114 
<211> 399 
<212> DNA 
<213> Homo sapiens 
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<400> 114 

ggcacgagag 

aatcctctgt 

cggaagagct 

tgtataataa 

ctcagaagat 

tctacgtaaa 

acccccagaa 



cagaagattc 
agctcctggt 
tcccggaggg 
gatgtggcat 
gattgaacca 
gtatgtgcag 
acgaatactg 



tctcagtcct 
agtgccctga 
cgaggaaaag 
cagacccaag 
caaagaaatc 
atctttagaa 
atcaggaaag 



ggtttctccc 
cccctgctgc 
agggaaagta 
aagccctcgg 
aggttttcat 
acctagagaa 
tcctggacg 



tccagaagac 
caccgtcctt 
gccagcaatg 
tgctttactc 
ctttcaaaca 
tggctacgac 



cccccaccca 
cagagagcaa 
tcgaacgcaa 
gataaagagc 
ttagccacct 
caggtcgtcc 



<210> 115 
<211> 399 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (399) 
<223> n = A,T,C or G 



<400> 115 

ggcacgaggc 

gtttctgcca 

ggtccagatg 

gaagtggcag 

gaagaatcag 

agtttatcag 

ttataagaaa 

<210> 116 
<211> 400 
<212> DNA 
<213> Homo 

<400> 116 

ggcacgagcg 

tcttcaagaa 

cacagagggc 

tagaaattaa 

aacttgtgca 

ctatgtctac 

ccacagcaaa 

<210> 117 
<211> 402 
<212> DNA 
<213> Homo 

<400> 117 

ggcacgaggg 

ggacggggtc 

ccgacagaag 

atccggagtc 

ctgccttgco 

agggcctctg 

ccccctgttg 

<210> 118 
<211> 395 
<212> DNA 
<213> Homo 

<400> 118 
ggcacgaggt 



tttttccaac 
gtggctacac 
ataaaaatat 
aaaaagcaga 
agggctgcta 
aagagagtgc 
taaaagggca 



sapiens 



gaccgggccg 
gaaaactgtg 
tataatcaga 
gaaaatggcc 
tctacggaaa 
acaaacaaaa 
aacaatgcag 



sapiens 



gagatcgctc 
ctcactgagg 
atgcgctggc 
caaggcgaga 
ggcccctgca 
tcccccagga 
taggcactgg 



sapiens 



ttttaaggat 
aaaggaaatg 
tgaaacaaaa 
ggtttacagg 
ttgcagatzca 
tgatgcacgg 
ggttgttgaa 



atcaggagag 
caggcagatg 
gagggatctg 
tcagaaaatg 
tctggagacc 
agttttgaaa 
aacaactcn 



aagacactct 
atgaactgct 
aattctcatt 
aaagtgaacg 
ctgaacaaat 
tgactgaatt 



tgatgtggag 
tcatccatta 
ttcagatgga 
gaactgtcta 
aaaggaagac 
caatcaagct 



agccgggccg 
gatgatgtaa 
gatcgagcag 
aagattggta 
cagaagacga 
gtgatgaatt 
gcagttaaca 



cccgggcgca 
taaaggaaca 
ctttagagaa 
ataaggaagc 
gaacttttgc 
cccaaatgaa 
agaagatggg 



gtctttaacc 
gaatcgagag 
acaagaaaaa 
ttgcaaagtt 
tgtaagttca 
gatggctgga 



atggcgtccc 
ttacgaggta 
cagctggaat 
ttagccaaac 
aaagttactt 
gcaatgtcta 



agctggccgt 
ccatgatgga 
tgaaggcgga 
ccctcacctc 
catttaggat 
tgtcttgtgg 
ctctaggagg 



gtcctggcag 
cgcctgtgtg 
ggggcctggg 
atggagcctg 
atgctcctgg 
tggcggtcgg 
gcaggcctcc 



gccacggcat 
caagatgctg 
cgcggggtcg 
gccacggacc 
atggggactg 
ccgttctgcc 
tt 



atgcctccaa 
tccagcagta 
agcaccccct 
cctcctaccc 
ggctgtgccc 
ccccagggca 



60 
120 
180 
240 
300 
360 
399 



60 
120 
180 
240 
300 
360 
399 



60 
120 
180 
240 
300 
360 
400 



60 
120 
180 
240 
300 
360 
402 



agagatacga atggggtgta gtagccgact gctcgcaggc acccccaggt 



60 
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tatgtggaca 
gaagataagt 
cattgtgttt 
tgaaagtgta 
cgcgtcggcc 
caaggatcag 



gagctaagcc 
agaaagtgac 
tgagaagttt 
gctggcacct 
tagcagatct 
gcatgtgaac 



caaagttgtg 
acagtaagag 
cactgacaag 
cgtccatcta 
gaacccaggt 
aaataaccat 



attttccact 
ccagaataca 
ttatcctggg 
atttgatggg 
gatttctgtt 
actgg 



ctgttctgtc 
ccaggtgaag 
ctgtgggaca 
tgtgtgtggg 
ctcaggaagc 



catgtcgagg 
gagagaattg 
tcactagctt 
gtgttgggca 
ttttaggtga 



<210> 119 
<211> 144 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (144) 
<223> n = A,T,C or G 

<400> 119 

ccggtaagga atatacttct tctgatacta aatatgccaa tatttaaaat gtaatattca 
gggattacaa ctgtgagggc taaacacacg gaattaccca ccaattcctc tgtagttctc 
tactaattca attttgcatc ctcn 

<210> 120 
<211> 392 
<212> DNA 
<213> Homo sapiens 



<400> 120 

ggcacgagac 

ttttccaaag 

aaccatctgc 

ctctgaaatc 

tgccacttgg 

atattgctac 

gataacccat 



caggtcataa 
agcttataag 
tgctcttctc 
tccgtgactt 
taacagagga 
actctaggac 
ctgacagcat 



gaggatccgt 
gcattgtagg 
tcttgtgtta 
aatacaagag 
gctatggaaa 
tttctccaaa 
ttggggagtg 



tccaatgatt 
atctggcctg 
ctcaaggtat 
aggtttattt 
cttgagacct 
attaacaaca 
tt 



ttcctaaaac 
ccctgactcc 
tagctgctgt 
cttactcacg 
aagcagaaat 
gaacaaaagt 



aatggaagtg 
actttaccag 
ggcaaatcaa 
ctgggtgcac 
gagttcaata 
gcaaggcagt 



<210> 121 
<211> 395 
<212> DNA 
<213> Homo sapiens 



<400> 121 

ggcacgagat 

tcatcacatc 

tattagacac 

gaaatcatct 

caaattaatt 

aggagaggac 

gcataatttg 



caatcacaaa 
tgtgaaaaga 
cctctattag 
tccctctgta 
gcctggtact 
taggtggcag 
gttccggcgg 



agtttatcct 
gtgctagtta 
tgttaacaga 
cagattgcaa 
cacagtccag 
gaaaataaat 
cggccaggtt 



taagacttcc 
taacaaatga 
caaagatgaa 
tatctgataa 
tgttaacagg 
gagattagca 
taaaa 



cttcagctgc 
gatcacaaat 
ggttaagttg 
taccctcaac 
caataatggt 
gtatttgatt 



tggaaggcag 
ttgaccattt 
aaatcaaatt 
tttcttggtg 
gtgattccag 
ggagccataa 



<210> 122 
<211> 288 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mi sc___f eature 
<222> (1) . . . (288) 
<223> n = A,T,C or G 

<400> 122 

cgcccgcgcc tctctgttct ctctcgcgcg cggtgtctct ctcgatagag tgcgcgacct 
gcacaccctc tgtgtggggt tctcgctccc cgtgtgcgcg cgcgcgcgct ctctgtggga 



120 
180 
240 
300 
360 
395 



60 
120 
144 



60 
120 
180 
240 
300 
360 
392 



60 
120 
180 
240 
300 
360 
395 



60 
120 
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ctcgcacaca ccgcgcgcgc gcgcgctctc tgtggggggg ccctccccgc accttgtgtg 18 0 

tgtgtgtctg tgttatctct gtgagatgtg cgtgnnnnnn nnnnntctgt gtgtgtgtct 240 
gccctccgcg ccgtgtctgt tatatatgcg ctcgctcgct ggggcgcg 2 88 

<210> 123 
<211> 393 
<212> DNA 
<213> Homo sapiens 



<400> 123 

ggcacgagga 

tccaaattgc 

ttttctccca 

tagaattaat 

aggcctaagg 

agacctctag 

gacaacccca 



tccattcttc 
catgcagata 
aacaggacaa 
tgcccactct 
tgggcatgaa 
gctggaagct 
ggaacttgga 



gacccccaga 
tcttgaacag 
gcccaggcag 
cccaccctac 
agcttgggaa 
tgggcttgca 
ccaaggcaga 



tgtgactcta 
caggacattt 
ggctgcatgg 
cagtgcagcc 
gcactgtcgt 
agtggatccg 
gcc 



aagaaggctg 
gcaggccttg 
agaggaatgg 
cggcaagggc 
ctctcagaca 
ggaccgaggg 



aaaatttttg 
tctactggac 
aacctggagc 
aggaattggg 
ggcgtcctaa 
tggtctcttg 



60 
120 
180 
240 
300 
360 
393 



<210> 124 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<400> 124 

ccgcgacgag 

aaaagaggga 

caggtcagaa 

atcatctgga 

acggagtttt 

tgaaaacacc 

gcctaatgtc 



atgatgatct 
tctgaattct 
aatgaaagtg 
gaccctgaac 
gaaatgactg 
tctgtaactg 
caagaattca 



gcttcttcca 
cattttcaga 
aacggaactg 
aaataaagga 
aactcaatca 
aattttctga 

a £rgggrgggtc 



ttatgcccag 
tggagaagtg 
cctagaagaa 
cgacagttta 
agctttagaa 
ggagaaacac 
cctc 



atgataaaaa 
gccgaaaaag 
tcagagggct 
tcagaagaga 
gaaataaaag 
cgaacttgaa 



ggattgatac 
cagaggttta 
gctattgcag 
gtgctgatgc 
ggcaggctgt 
attcacaccg 



60 
120 
180 
240 
300 
360 
394 



<210> 125 
<211> 390 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (390) 
<223> n = A,T,C or G 



<400> 125 

ggcacgagcc 

ggatcacatt 

ctattttcca 

nnnnnnnnnn 

ctgctataat 

gaactaatgt 

taaacaccgc 



cttatacaaa 
atagatatta 
gaagcacgta 
nnnnnaactt 
ttacaattat 
ctatgtagaa 
tagattactt 



catatatgaa 
ttctgtaact 
gttctaactt 
tattcaagta 
agcaatactt 
gatatgctgg 
tccaggaagc 



catatatact 
ttctgttttc 
nnnnnnnnnn 
cttcacattt 
tgagaaaggt 
ctcaaatatt 



ttttttgttg 
acccaaaata 
nnnnnnnnnn 
taagtggaca 
ctttgcaagt 
atgtacattt 



tataaaaaca 
cagcagagca 
nnnnnnnnnn 
ttccatttgt 
atatccatat 
aatgtcttaa 



60 
120 
180 
240 
300 
360 
390 



<210> 126 
<211> 388 
<212> DNA 
<213> Homo 



sapiens 



<400> 126 

ggcacgaggt 

attccttact 

aggccttaat 

tagtcaactg 

ataacttgac 

attaacaaca 



cagcacacat 
gtttgagatg 
gctccattgc 
ttaggctgca 
caactctaaa 
caatcaggct 



tactttaaca 
attaggtttt 
taatagctaa 
atcatttttt 
atatatatgt 
ctgtttcagt 



ctttggactt 
agggactagc 
atgtggaaaa 
tttaaaaatc 
aattacttct 
tagttcttag 



gaaattctga 
cattttatct 
gtttagaatt 
tgcttatggc 
agatgtaagt 
agtggtgaaa 



aagatcagaa 
cacatgactc 
acatttaatt 
attattcgag 
agtttttcat 
aaaaatcttt 



60 
120 
180 
240 
300 
360 
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acagtaagtg caaaattata atccaagg 



388 



<210> 127 
<211> 388 
<212> DNA 
<213> Homo sapiens 



<400> 127 

ggcacgagag 

tgaaaagagt 

tctattagtg 

cctctgtaca 

ctggtactca 

ggtggcagga 

gctccagctg 



ttaatccaaa 
gctagttata 
ttaacagaca 
gattgcaata 
cagtccagtg 
aaataaatga 
tcgaccaggt 



agacttccct 
acaaatgaga 
aagatgaagg 
tctgataata 
ttaacaggca 
gattagcagt 
tctaaaaa 



tcagctgctg 
tcacaaattt 
ttaagttgaa 
ccctcaactt 
ataatggtgt 
atttgacttg 



gaaatcagtc 
gaccatttta 
atcaaattga 
tcttggtgca 
gattccagag 
gagccatagg 



atcacatctg 
ttagacaccc 
aatcatcttc 
aattaattgc 
gagaggacta 
catcaattct 



<210> 128 
<211> 267 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (267) 
<223> n = A,T,C or G 

<400> 128 

actgtgtgtg tgtctgtttt ctctctctct cttctcagtc acactttttt tttgggacac 
accctccatc cgcggggggg tttttttccc ggcgcgcgcc cttttttttt gtgtgtttct 
ctgcgcgcct ctcttttttc tctctcttcc ccccccgctt annnnnnnnn nnnnnngcgg 
ggggggtttt cgcgcgttcn nnnnnnnnnn nnctcttccg cccccccaca ggggggtgct 
gtttattatc tttctttctc cctgagc 



60 
120 
180 
240 
300 
360 
388 



60 
120 
180 
240 
267 



<210> 129 
<211> 389 
<212> DNA 
<213> Homo sapiens 



<400> 129 

ggcacgagct 

cagtccttag 

tgaatgaact 

aatgcgggga 

tccctctgag 

cccatgctga 

gatagatcca 



tgactgcaaa 
aacagtgtct 
aaatcagtct 
aggaaaacgt 
ctaagaacca 
ccccatcttt 
ggagtgacaa 



cttgctgaag 
gggacacagt 
tttagggatt 
tctaaaatta 
gacagcctta 
actgtacaat 
ctccagtgg 



gtagggactg 
gtgttctcaa 
tactgttaac 
gaagactact 
ggctggtaac 
tcacttttca 



tttgtcttgg 
atatttgttg 
caccatggga 
ttctactctc 
tcctataagc 
tggactgaag 



acttcgctgc 
ctggaataaa 
aaattaaata 
agcttctgat 
tggtcctcct 
gcaccaccaa 



<210> 130 
<211> 319 
<212> DNA 
<213> Homo sapiens 



<400> 130 

tgttgtaact 

cgacctccag 

ttgtcccctt 

ttgcacccat 

gggtaaggct 

ttcctccagt 

<210> 131 
<211> 385 
<212> DNA 



gggagtggag 
gttcttcctt 
ggaagcttgc 
ttcaacccta 
ctgttcactt 
tttgttcct 



gcccagtggc 
ctccctagct 
cctgccctca 
gggcgggggc 
ccccctgcct 



tggggagaca 
gttgctttgg 
tcttgcccat 
aagtggggca 
ccacagaacg 



ttaggtggtg 
tctggccact 
gccttctact 
aggatggacc 
aagccacgga 



gggcccagcc 
cccagccccc 
gccaggagac 
agcagaaggg 
ttccgttatc 



60 
120 
180 
240 
300 
360 
389 



60 
120 
180 
240 
300 
319 
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<213> Homo sapiens 



<400> 131 

ggcacgagaa 

gagaagacac 

atgatgaact 

ctgaattctc 

atgaaagtga 

accctgaaca 

aaatgactga 



acgtttcagc 
tcttgatgtg 
gcttcatcca 
attttcagat 
acggaactgt 
aataaaggaa 
attcaatcaa 



tacgaaagtg 
gaggtttctg 
ttaggtccag 
ggagaagtgg 
ctagaagaat 
gacagtttat 
gcttt 



agctttttcc 
ccagtggcta 
atgataaaaa 
cagaaaaagc 
cagagggctg 
cagaagagag 



aacttttaag 
cacaaaggaa 
tattgaaaca 
agaggtttac 
ctattgcaga 
tgctgatgca 



gatatcagga 
atgcaggcag 
aaagagggat 
aggtcagaaa 
tcatctggag 
cggagttttg 



<210> 132 
<211> 383 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (383) 
<223> n = A,T,C or G 



<400> 132 

ggcacgaggg 

tggcctgtga 

aaacacttct 

ttttacaaaa 

nnnnnnnnnc 

tttttaaggt 

agaccaagaa 



gaatagaggg 
catactgtct 
gtgactactt 
aaaaagagaa 
ccaaacctga 
ttagggttgg 
ggaggtcgcc 



tccctggtga 
tgtttctgag 
aatcacaaag 
gggatctgaa 
cgtttgagga 
ttggccatta 
aag 



cagggcaagg 
aatcctcccc 
gaaattttca 
tgttttcagt 
cccgcctttt 
aaccatcccc 



ctagatctgg 
tacttctcta 
ggagatataa 
tcacgctagg 
tttcagccaa 
ggaaagaaaa 



agcctgcact 
gataatctcc 
tcgaattcta 
gatcnnnnnn 
tttaaaagat 

tgggggtaaa 



<210> 133 
<211> 382 
<212> DNA 
<213> Homo sapiens 



<400> 133 

ggcacgagat 

ggagcatgaa 

gactcatgtg 

agctggacaa 

gtttctgctc 

acaggttgac 

acactggggc 



aagatctgag 
gcattttttt 
gtctgaatgt 
agcagccttg 
ctctctctgg 
tgtggtgagt 
cttgaatggc 



gtgttacaca 
aatgtgttgg 
attttagggc 
atctgagtga 
ctcttttctg 
tccaggacac 
ca 



cataattgtc 
caggccccat 
tttctgggaa 
gctaactgac 
ctttttaaca 
caaggtctac 



ccaattttta 
taaatgcata 
ttgtcttgac 
acaatgaaac 
ggtgtcttca 
tgcactcggg 



agattgatgg 
aactgcatag 
agagaacctc 
tgtcaggcat 
gtcagggagg 
aatctaaccc 



<210> 134 
<211> 375 
<212> DNA 
<213> Homo sapiens 



<400> 134 

ggcacgagca 

cttaaaattc 

aggaatttaa 

aagcttacat 

gaccacacct 

gagagtatgg 

aaaaggaaaa 



agcctttcat 
tgctttcctt 
tgcaaacgga 
cataccaagg 
agaaagcgtt 
gaagaaacaa 
agaaa 



ttacattaaa 
tgagtagaag 
ttgcagtcag 
agtggagagt 
tctcttcctg 
tacaggattg 



ttataacttt 
gtatttaact 
cactttctga 
tgaagtttcc 
agtatttcaa 
cctttaatta 



tcattcattc 
tgttttgttt 
atgttttcac 
tcccagtgac 
aaagatgtaa 
attaagaatt 



ctaaaccaaa 
ttccttcaga 
acagtatgca 
tccagtgaca 
aagagctggg 
gcctcctgat 



<210> 135 
<211> 376 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
385 



60 
120 
180 
240 
300 
360 
383 



60 
120 
180 
240 
300 
360 
382 



60 
120 
180 
240 
300 
360 
375 
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<400> 135 

ggcacgagac 

ccagagtctg 

cacgggctca 

ccctgaccgg 

ggagctgctt 

tggcatagag 

ccgctgtgtg 



ctgtttgagg 
gaggcagctg 
gatctgctcc 
cagttgcgtc 
gccgtgttga 
ctgggcctgg 
ctgctg 



tggaactcca 
agatagagcc 
ctggagcccc 
gctatttggt 
ccccagtcac 
caggccagag 



agcagctcgc 
ggaggcccag 
catcctcagt 
gctggagcct 
caatgtggct 
cctgcggcta 



accttggagc 


gactggagct 


60 


gcccagaggt 


cgcccaggcc 


120 


ctgcgcttct 


cctacatctg 


180 


gatgcccacg 


cagctgtcca 


240 


gttcccctgc 


aggatctgag 


300 


gagtgggcag 


ctggggcggg 


360 






376 



<210> 136 
<211> 371 
<212> DNA 
<213> Homo sapiens 



<400> 136 

ggcacgaggt 

tcagaagagg 

tataaggcat 

cttctctctt 

tgacttaata 

agaggagcta 

taggactttc 



cacctctacc 
atccgttcca 
tgtaggatct 
gtgttactca 
caagagaggt 
tggaaacttg 
t 



agccctcctt 
atgattttcc 
ggcctgccct 
aggtattagc 
ttatttctta 
agacctaagc 



tctccagatg 
taaaacaatg 
gactccactt 
tgctgtggca 
ctcacgctgg 
agaaatgagt 



gcttcttcat 
gaagtgtttt 
taccagaacc 
aatcaactct 
gtgcactgcc 
tcaataatat 



aaccaccagg 
ccaaagagct 
atctgctgct 
gaaatctccg 
acttggtaac 
tgctacactc 



<210> 137 
<211> 258 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (258) 
<223> n = A,T,C or G 

<400> 137 

cagtttcttt gtgcgcgcgc cccccctttt ttctctctct ctccgcgcgg gcgtgtccct 
ccnnnnnnnn nnctgtgtgt gcgctctctc cgccccatat atattgtgtt tttctctgtg 
gannnnnnnn nntctctcta gagtcttttc tctcccctcg cgcgcacatt gttatacact 
cctcccctct ctttcttttt acacacacat atatattgcg cccctctccc cccacacatt 
tatatctctc tcacatct 



60 
120 
180 
240 
300 
360 
371 



60 
120 
180 
240 
258 



<210> 138 
<211> 368 
<212> DNA 
<213> Homo sapiens 



<400> 138 

ggcacgagac attttgagac ttcttccaaa ttggtcccta gaaagttaca ctggtttgta 60 

ctctcactta tgtcactgtt tataccacca ctgactgctg cctgctttat tatttcttta 120 

atgagttgga ctgaacagtg gttaatcctg actctgtttt tgactgacag ttaacagtta 180 

catgaaccat tcatattaca gctcttactt aaatttgacc aagccaggat atatctgtta 240 

ggccacattc atttagggat catgttttcc aaagcaggtt tgggcaaaat taatccacag 300 

gactgaaagg tatacatctg tgagttttgt tctcacttcc acctctaatt tgaagaacac 360 

tttaattg 368 



<210> 139 
<211> 372 
<212> DNA 
<213> Homo sapiens 



<400> 139 

acggcacgag ctggctcctc gttttctttg tggacagtct cattaccaac atcctcgttc 60 

gggtctagga tgcctttctg ctcgagggga ccaacgcggc gattcgctat gccttggcca 120 

ttatcttgta caacgagaag gacatcttga ggctacagaa tggcctggaa atctaccagg 18 0 
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acctgcgctt cttcaccaat accaactcca tcagccggaa gctgatgaac attgccttca 240 

atgacatgaa ccccttccgc atgaaactat tgcggcagct gtgcatggcc caccgtgagc 300 

ggctggaggc tgatctgccg gagctggagc aacttaaggc aaagtacctg gctaggcagg 360 

catcccggcg ca 372 

<210> 140 
<211> 365 
<212> DNA 
<213> Homo sapiens 



<400> 140 

ggcacgaggc 

caggagagat 

tggaggcctg 

agaggaatcc 

aatgtttgca 

catcagcaaa 

atggc 



tgagagtgct 
aaaagcatca 
cgatttggag 
agtaggatgt 
agtacagggc 
gccaaaagtt 



tgatacaccc 
gcaccatgag 
ttaaactctc 
gggcatggtt 
agaatttcat 
tcatttcttt 



ttgaatcccc 
attcacctgc 
agtgatctct 
ttcccggaag 
ccagcctcag 
gactgtggga 



tcttatatga 


tgccccagcc 


60 


ctctggtcgt 


tagggaacaa 


120 


gtgttgacaa 


caccaaagct 


180 


gctgactgag 


cagttctgca 


240 


aaccttgagc 


caagactcag 


300 


gtgctagtcc 


caacctttag 


360 






365 



<210> 141 
<211> 353 
<212> DNA 
<213> Homo 



sapiens 



<400> 141 

ggcacgagaa 

cccctttttg 

tatcttcggg 

tctaaaccct 

ttcccagagt 

gggtccaatg 



acaaaagaga 
ccgggatatg 
ccaggtgcct 
tcagcccttc 
cacggcacaa 
tgtgttttag 



gcaagagaga 
atggatctgt 
agagtcctag 
actggacaga 
attcatggaa 
aaatatctcc 



agacagtggg 
cagctggtga 
agagctagag 
agaggaaact 
gaagcagcag 
acagggttaa 



tgaagtcctg 
ggcccctcta 
atggagggaa 
gaggctccat 
gaaactcagt 
tgactcaatt 



gttccagact 
agaggggtgg 
attcagatca 
ctgcatgacg 
tctccagtct 
ttt 



60 
120 
180 
240 
300 
353 



<210> 142 
<211> 352 
<212> DNA 
<213> Homo 



sapiens 



<400> 142 

ggcacgaggc 

gaaggtgccc 

ggctcctgct 

gggctgggct 

gaggctgcag 

tctcagggtg 



cactcggggg 
ccagaggggg 
ggcctcccct 
gtcagtggtc 
gtgtggcagg 
gagagccagg 



cccaggaacc 
cagccgcctg 
gcctgtcccc 
ccaggtgcat 
agggggccca 
gctgtcctga 



cctcagttag 
tgaggagcag 
aaagcacagg 
cccactttcc 
gccgtgaggg 
acctccagag 



ggcttctcag 
gcgtgtctgg 
gctcagctcc 
agcagcactt 
caccaggttc 
ggggtgagct 



tcactgagcg 
gtaaccatgt 
agagggagac 
ggtgccagca 
aggcccggca 

gg 



60 
120 
180 
240 
300 
352 



<210> 143 
<211> 470 
<212> DNA 
<213> Homo 



sapiens 



<400> 143 

gacttctgtc 

ccaatggagt 

ggaagaggaa 

ttggttcaag 

ttagaaactt 

gataggttga 

gggcatggag 

ggggg^tgct 



tttttaggat 
atgagaagtt 
aagatgtgtt 
aaactgtgaa 
gcgaaaaatg 
gctcacatgg 
acagacaata 
aatttgatca 



cccatcgact 
tccagtgaaa 
cggggtggcc 
gggaaagaaa 
gcttaatcta 
tggagagcca 
agttgaatgc 
atacgtttgg 



tcaattcggc 
aacagaaaga 
ttggaagtga 
gggttatact 
aatgagtgtt 
cagttgcggg 
tcttttttta 
gaaaacttat 



acgaggtcat 
atccagtaga 
acgttgaagg 
gagaaatgga 
aggggagata 
tgcttgcact 
acaaaggaag 
attttcttgg 



gagaaaggaa 
atttatttag 
actactgaga 
agagataatt 
cagctgtgat 
gataatgtga 
ctaaaaggga 



60 
120 
180 
240 
300 
360 
420 
470 



<210> 144 
<211> 456 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1) . . . (456) 
<223> n = A,T,C or 



<400> 144 

tagcactttt 

caccggtgtt 

tacagttgga 

ttcgcccacg 

ttgggatgtt 

agctcgccct 

aattatactg 

ggacctcaaa 



gtttaggagg 
cttcatccgg 
ttctccctct 
cccactctca 
cattagaact 
ctcctgggtt 
cttacaacca 
cagatccaat 



accccatcga 
ctgcaccccc 
tggatctggc 
cccggaacct 
ctgaaaacta 
ccgcctggtt 
tgctgagtct 
cacaagaaga 



ttcgaattcg 
aacagagctc 
tctgccttag 
ttcatctctt 
cagttctccc 
gcagagtgag 
gcaaggactt 
gagatn 



<210> 145 
<211> 464 
<212> DNA 
<213> Homo sapiens 



gcacgagctg 
tttcttcccc 
tccgacctag 
attgaagcct 
ctttatgagg 
cccatgggac 
cgtccaagcc 



cactgagcag 
agatcccttt 
agggatcagc 
tttaggccca 
actgcaccac 
agccctctga 
tttccgtcca 



60 
120 
180 
240 
300 
360 
420 
456 



<400> 145 

atcgcccata 

ctgcttttag 

agggcttgct 

gacatccccc 

ttgactctcc 

aggcagcccc 

gcaaggcctt 

tgtgcagggg 



cggcgagccc 
gaccctcccc 
gctgcccaca 
ttctggctca 
ctttgtcctc 
aaagtggtgg 
tattaccttt 
ctcctggcag 



accgacgcga 
ctgccttgca 
ctcctgcttt 
tgtgtctgac 
cccatacagc 
agggggatgg 
actccccttc 
ctactgagtg 



attcggcacg 

gggggctcgg 

ttgggatatc 
accaacaaca 
tggggtgggg 
caaagactgt 
ctctcccatc 

agggttcctg 



aggggaaaca 
ggagagcaat 
taactgctaa 
tgggctctgt 
tggatcccta 
ataggcgcca 
accagcctca 
gtcg 



caggcctctt 
atcaggagct 
ggagggagtt 
ccctctctct 
tacctggggc 
ctggactctg 
aggcctgagg 



60 
120 
180 
240 
300 
360 
420 
464 



<210> 146 
<211> 448 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (448) 
<223> n - A,T,C or G 



<400> 146 

ggcacgagct 

ctttcttccc 

gtccgaccta 

tattgaagcc 

cctttatgag 

gcccatggga 

tcgtccaagc 

ggaaagagaa 



gcactgagca 
cagatccctt 
gagggatcag 
ttttaggccc 
gactgcacca 
cagccctctg 
ctttccgtcc 
nattattcct 



gcaccggtgt 
ttacagttgg 
cttcgcccac 
attgggatgt 
cagctcgccc 
aaattatact 
aggacctcaa 
atcatcgn 



tcttcatccg 
attctccctc 
gcccactctc 
tcattagaac 
tctcctgggt 
gcttacaacc 
acagatccaa 



gctgcacccc 
ttggatctgg 
acccggaacc 
tctgaaaact 
tccgcctggt 
atgctgagtc 
tcacaagaag 



caacagagct 
ctctgcctta 
tttcatctct 
acagttctcc 
tgcagagtga 
tgcaaggact 
agagatttca 



60 
120 
180 
240 
300 
360 
420 
448 



<210> 147 
<211> 439 
<212> DNA 
<213> Homo 



sapiens 



<400> 147 

ggcacgagga 

cttttattca 

atcacagccc 

tgcctggaag 

gagaaagcca 



aagttaagca 
acgcaatgac 
agcaagctga 
atcataacag 
tctgcaggtg 



actacaggaa 
agcactgacc 
caacatagaa 
ttactgcatc 
ttttactggt 



atggctttgg 
gaagaggcag 
ggacccatag 
aacggtgctt 
tatactggag 



gagttccaat 
ccgtgactgt 
ccttgaagtt 
gtgcattcca 
aaaggtgtga 



atcagtctat 
aacacctcca 
ctcacacctt 
ccatgagcta 
gcacttgact 



60 
120 
180 
240 
300 
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ttaacttcat atgctgtgga ttcttatgaa aaatacattg caattgggat tggtgttgga 
ttactattaa gtggttttct tgttattttt tactgctata taagaaagag gtgtctaaaa 
ttgaaatcgc cttacaatg 

<210> 148 
<211> 334 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . - (334) 
<223> n - A r T,C or G 



360 
420 
439 



<400> 148 

ccccgcgcgc 

gtgagagagt 

tgtctctcct 

cnnnnnnnnn 

nnnnnnnnnn 

gtgtgttcta 



gctccctctc 
ctgtgcgcgc 
cttgcggggc 
nnnnnnnnnn 
nnnnnnnnnn 
cagcgcgata 



tatcttttat 
gcgcatatat 
atatgcgtgc 
nnnnnnnnnn 
nnnnnnnnnn 
aagagagaca 



acaaaatata 
atatgggagg 
gcacacccgc 
nnnnnnnnnn 
nnnnnnnncg 
caca 



gagagcgcac 
ggtgtctccc 
gcgctgtgtc 
nnnnnnnnnn 
cgcgcacaca 



atctctgtgt 
cccatctgtg 
tcttttgtgc 
nnnnnnnnnn 
cccacacacc 



60 
120 
180 
240 
300 
334 



<210> 149 
<211> 428 
<212> DNA 
<213> Homo sapiens 



<400> 149 

ggcacgaggt cctgagcagc ctcatgggag gtgaattaga gaaaacaaaa gagagcaaga 60 

gagaagacag tgggtgaagt cctggttcca gactcccctt tttgccggga tatgatggat 120 

ctgtcagctg gtgcctagag tcctagagag ctagagatgg agggaaattc agatcatcta 180 

aacccttcag cccttcactg gacagaagag gaaactgagg ctccatctgc atgacgttcc 240 

cagagtcacg gcacaaattc atggaagaag cagcaggaaa ctcagttctc cagtctgggt 300 

ccaatgtgtg ttttagaaat atctccacag ggttaatgac tcaatttttc atgcatgatt 3 60 

gctagtaatg acaatcatgt tatgtttggt tctgtagctt tggaaatcac tccttccact 420 

tgagtttc 428 



<210> 150 
<211> 427 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (427) 
<223> n = A, T, C or G 



<400> 150 

cgccccaaan nnnaatctct aaaggggtaa gggagatacc taccttgtct ggtaggggag 60 

atgtttcgtt ttcatgcttt accagaaaat ccacttccct gccgacctta gtttcaaagc 12 0 

ttattcttaa ttagagacaa gaaacctgtt tcaacttgaa gacaccgtat gaggtgaatg 180 

gacagccagc caccacaatg aaagaaatca aaccaggaat aacctatgct gaacccacgc 240 

ctcaatcgtc cccaagtgtt tcctgacacg catctttgct tacagtgcat cacaactgaa 3 00 

gaatggggtt caacttgacg cttgcaaaat taccaaataa cgagctgcac ggccaagaga 360 

gtcacaattc aggcaacagg agcgacgggc caggaaagaa caccaccctt cacaatgaat 420 

ttgacac 427 



<210> 151 
<211> 437 
<212> DNA 
<213> Homo sapiens 



<220> 
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<221> misc_feature 
<222> (1) . . . (437) 
<223> n = A,T,C or G 



<400> 151 

ccgagccgga 

catggagctt 

ggcagaggag 

aaatattact 

acgtggtccc 

ttctgaagga 

atgggaagcc 

agagggacta 



tgnccttnnn 
tgaaagacga 
aaagcacatg 
aaggattttc 
tgctgctatg 
tagagaccaa 
acgaagtggg 
cagccct 



gagtatngca 
gtaggtgtta 
ccaagtcagc 
acaatgctgc 
gggcttacgt 
gcatggaggg 
aggggtgggg 



angattccaa 
gcaaggaaat 
aaagaaaagt 
actgggctag 
tctacaggca 
tgttgaggag 
gtccaggctg 



ttcggcacga 
aaggaggaac 
agaattcgaa 
aaactgaagc 
aggacaggtt 
gcttctgcga 
gaggggccca 



gagacagtgg 
gggggttacg 
aactttttaa 
taaaacagat 
gtgatgaggg 
gacctgaatg 
atgtatgtgt 



60 
120 
180 
240 
300 
360 
420 
437 



<210> 152 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (425) 
<223> n = A,T,C or 



<400> 152 

ggcacgagct 

ctttcttccc 

gtccgaccta 

tattgaagcc 

cctttatgag 

gcccatggga 

tcgtccaagc 

ggaan 



gcactgagca 
cagatccctt 
gagggatcag 
ttttaggccc 
gactgcacca 
cagccctctg 
ctttccgtcc 



gcaccggtgt 
ttacagttgg 
cttcgcccac 
attgggatgt 
cagctcgccc 
aaattatact 
agggacctca 



tcttcatccg 
attctccctc 
gcccactctc 
tcattagaac 
tctcctgggt 
gcttacaacc 
acagatccaa 



gctgcacccc 
ttggatctgg 
acccggaacc 
tctgaaaact 
tccgcctggt 
atgctgagtc 
tcacaagaag 



caacagagct 
ctctgcctta 
tttcatctct 
acagttctcc 
tgcagagtga 
tgcaaggact 
agagatttca 



60 
120 
180 
240 
300 
360 
420 
425 



<210> 153 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 153 

ggcacgagcc 

gtgtgagtcc 

ctgtgcacct 

catgtgtaca 

cgtgtgtacg 

ctccgaggct 

tgtgtccctc 

t 



gtggctgcct 
caccaggagc 
ggcgaggctg 
cgtgtgtacg 
tgcgtgtgta 
tctccagagc 
aggaaccagg 



cgtgagcctc 
cacgtgcccg 
aaggcgaggg 
tgtgtatgcg 
cacatgcgtg 
tgggagctgg 
gtcctccctc 



ccagagccca 
gccttgccct 
gtggaggagg 
tgtgtgtacg 
gccgcctgtg 
ctggcgtggc 
ccctttctgc 



ggcctccgtg 


gcctcctcct 


60 


caaggatttt 


tgcttttctc 


120 


ccccagcaca 


gcctcatctc 


180 


tgtgtatgcg 


tgtgtgtacg 


240 


gtgtgcacgt 


gtgctctggg 


300 


aagggcatgc 


tctggggcag 


360 


ctggtcagcc 


ccgtggcctc 


420 






421 



<210> 154 
<211> 423 
<212> DNA 
<213> Homo sapiens 



<400> 154 

ggcatgagtg 

agcagcagcg 

ccttttacag 

tcagcttcgc 

gcccattggg 

accacagctc 

tctgaaatta 

gtc 



gaagggaggc 
gtgttcttca 
ttggattctc 
ccacgcccac 
atgttcatta 
gccctctcct 
tactgcttac 



agctgccttt 
tccggctgca 
cctcttggat 
tctcacccgg 
gaactctgaa 
gggttccgcc 
aaccatgctg 



gtttgccatg 
cccccaacag 
ctggctctgc 
aacctttcat 
aactacagtt 
tggttgcaga 
agtctgcaag 



gatgggtagg 
agctctttct 
cttagtccga 
ctcttattga 
ctccccttta 
gtgagcccat 
gacttccgcc 



ggctgcactg 
tccccagatc 
cctagaggga 
agccttttag 
tgaggactgc 
gggacagccc 
aagcctttcc 



60 
120 
180 
240 
300 
360 
420 
423 
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<210> 155 
<211> 312 
<212> DNA 
<213> Homo sapiens 



<400> 155 

tctgtcactc 

tgtgtgtata 

tatacacaca 

gtgtgtgaga 

aacacatatg 

tttgtggttt 



acaaaacaca 
caacccgcgc 
cgtgtgtgtg 
cacacactcc 
cgctcacaga 
ct 



gtgcgcgcac 
gcgagagagc 
tgtcgctctc 
tctctccgag 
gagcatatat 



atagcggggg 
gctctctttt 
cgaaaataca 
tggggagaga 
atgttttttt 



gggagcacac 
gtggggggga 
cactataaca 
gagatcgcgc 
tgagagaaga 



acacaagatg 
aaaaaactct 
aacgcactgt 
tccactctta 
gagagatctc 



60 
120 
180 
240 
300 
312 



<210> 156 
<211> 428 
<212> DNA 
<213> Homo sapiens 



<400> 156 

tgaccttcca 

aggaggaggt 

ttgccttaga 

tccaggtgca 

cgctctacct 

tgagcctctt 

ccctcaagga 

tcctgtca 



ggctacctac 
gaagcagcat 
gcgcaagggc 
tggattggtg 
gctgcttcat 
tcctgatacc 
cctcacgctg 



gcaggtgtcg 
gcccaggaac 
caggccctgg 
ctgcccctca 
gagcgggagg 
caactgctcg 
acgagcaatc 



gggccaacaa 
tctgggctgc 
aggaggatga 
tgctgcccag 
acagcttcta 
agttcctgga 
agaggtactc 



gcacctgcag 
ctacaggggt 
agacacagag 
cttctactca 
cagccagggc 
tgtgcagaag 
cctggtcagg 



gagctggccc 
ctgctgcgag 
acaagggacc 
gagctcttca 
attgccaact 
cacttgtggc 
gacaagtgtt 



60 
120 
180 
240 
300 
360 
420 
428 



<210> 157 
<211> 430 
<212> DNA 
<213> Homo sapiens 



<400> 157 

ggcacgagag 

gaagggcttg 

gcagcttggt 

ccacttgatg 

ggcagtgccc 

cttctccctt 

atcctggttt 

ctcacaatgg 



gactttgagc 
gagaaggaga 
ttgctgcagc 
ggggccaact 
tgagaaccac 
ctgtctagca 
gcacccgatg 



ccagagagat 
aggggtctga 
agactcggac 
ccaacccaat 
caacctcaca 
cctgtttgca 
aggatttagc 



gaagtcattt 
aggtggtgtg 
aagcattgtt 
gtcccattcc 
gcctgtctcc 
ctgggattgt 
aattttaggc 



gctcaaggca 
ggacacatga 
tcagtgcctg 
tatcctgaaa 
attttattgt 
cctgtctgtc 
tgtgcttcgg 



gcagtcagtg 
gagtgatctc 
gtttctccct 
tgcttctaaa 
cttctgggaa 
cttcagttgg 
caaaggccaa 



60 
120 
180 
240 
300 
360 
420 
430 



<210> 158 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<400> 158 

ggcacgaggg 

gttgaattag 

attatgcaga 

cccttccatg 

tccaagtcca 

aggggctgct 

tgaggcattt 



aagatttcca 
aggtgaaagc 
atttgttgag 
agccctcaac 
ggcctgtgat 
ccatgggggt 
gctgcattga 



gtggtctcaa 
ctccttcctc 
caaattgcaa 
ctgggatctc 
ttagaagggt 
cacacctctc 
ctgttgtagc 



tggtgtgaat 
actctttttt 
cagcccaagc 
gtgtatcttt 
caggttgggt 
tcctgtgggt 
tttaactcgt 



cctatgaagg 
agaaacagtt 
cacagctagc 
gttggaatgg 
aggagagagg 
tttcgctggt 
gtgca 



tgtcttattt 
tagttttatt 
tccacaagag 
acattaggtt 
agagtcttgg 
gattgagttc 



60 
120 
180 
240 
300 
360 
405 



<210> 159 
<211> 403 
<212> DNA 
<213> Homo sapiens 



<400> 159 
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ggcacgagcc tgactcaagg ggttttggaa gatttccagt ggtctcaatg gtgtgaatcc 60 

tatgaaggtg tcttatttgt tgaattagag gtgaaagcct ccttcctcac tcttttttag 12 0 

aaacagttta gttttattat tatgcagaat ttgttgagca aattgcaaca gcccaagcca 180 

cagctagctc cacaagagcc cttccatgag ccctcaacct gggatctcgt gtatctttgt 24 0 

tggaatggac attaggtttc caagtccagg cctgtgattt agaagggtca ggttgggtag 300 

gagagaggag agtcttggag gggctgctcc atgggggtca cacctctctc ctgtgggttt 360 

tcgctggtga ttgagttctg aggcatttgc tgcattgact gtg 403 



<210> 160 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (417) 
<223> n = A,T,C or G 



<400> 160 

gttctgtggg 

ggcctgtgac 

gagacttctg 

ttagaaaaaa 

nnnnnnnccc 

tcggaggttc 

cccttggaaa 



aatagagggt 
atactgtctt 
tgactactta 
aaagagaagg 
caaactccag 
agggttggtt 
agttttctta 



ccctggtgac 
gtttctgaga 
atcacaaagg 
gatttgaatg 
aatgggggcc 
ggcaaattac 
ctttgggggg 



agggcagggc 
atcctcccct 
aaattttcag 
ttttcagttc 
cccccttctt 
aaaaacccca 
tggccctttg 



tagatctgga 
acttctctag 
gaatattatc 
agtttagtta 
taaccccacc 
aaagaaaatg 
acgtnggccc 



gcctgcactt 
ttaatctcca 
aaatactatt 
tcnnnnnnnn 
taaaaatttt 
ggggttaacc 
gggttac 



60 
120 
180 
240 
300 
360 
417 



<210> 161 
<211> 300 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (300) 
<223> n = A,T,C or G 



<400> 161 

ctatatctct 

ctctcactcg 

tttcttgggc 

tagacacacc 

cccctctgtg 



ctgcgccctc 
cgggcgcaca 
gccccctctt 
cccccgcgcg 
tgtgggcgcg 



tccccctctt 
ccccccttta 
tttttttctt 
tgttttttat 
ttccccccca 



gtgttttccc 
caaaataggg 
tttgcgggcc 
aaatatctgt 
cacacacaga 



ccgcccctct 
ggctctctgt 
cccccctgtg 
ctctcacaca 
gtgtgtgnnn 



agagatatct 
gtgtggtgtt 
tgtctctctc 
ccccctactg 
nnnnnnnnnn 



60 
120 
180 
240 
300 



<210> 162 
<211> 411 
<212> DNA 
<213> Homo sapiens 



<400> 162 

ggcacgaggg 

gctgcagccc 

ctccccactt 

ctctccacgt 

agcatttgcc 

ccgcgttgac 

agacccttca 



caccgagcct 
tcacctgccg 
cgccagcgcc 
gcgcgactgc 
acaatgctgt 
catcccaatg 
cgttggaagg 



cctgtgggag 
cagccttagc 
cgctcctcgg 
gaggctggac 
catccactga 
aagagcaggc 
agtgatgctc 



gtcccgaggc 
tgagcagccg 
ctcggcccgg 
gctacgggct 
ctttacattt 
agaaagacgt 
aaggttagta 



agcttcgcct 
ccgccactgg 
ggtagtttgt 
cctggaaagg 
gcttcctggg 
ccgcactgag 
gaacagatca 



gctcggcctg 
gcgccccccg 
agggacgcag 
agcagacacc 
agcttgtggt 
aggattctgg 



60 
120 
180 
240 
300 
360 
411 



<210> 163 
<211> 412 
<212> DNA 
<213> Homo sapiens 
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<400> 163 

gcacgatcca tcattcaatt cggacagcca ctccaactga cctgttccgt ggctgcctcg 60 

agagcctccc atagcccagg cctccgtagg cctcctcctg tgtgagtccc accaggagcc 120 

acgtgcccgg ccttgccctc aagggttttt gcttttctcc tgtgcacctg gctaggctga 18 0 

aggcgagggg tggaggaggc cccagcacag cctcatctcc atgtgtacac gtgtgtacgt 24 0 

gtgtatgcgt gtgtgtacgc gtgtacgcgt gtgtgtacgc gtgagtacgt gctgtgtgta 3 00 

cacatgcgtg gccgcctgtg gtgtgcacgt gtgctctggg ctccgaggct tctccagagc 360 

tgggagctgg ctggcgtggc aagggcatgc tctggggcag tgtgtccctc ag 412 



<210> 164 
<211> 411 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (411) 
<223> n = A,T,C or G 



<400> 164 

ggcacgagag gatatggtgc aaaaaaatat gattttgtta accacaacaa aaagaaaggt 60 

aagaaatgct aggagaaagc taaaagctcc atactaaaat aatggtccta atattaagca 120 

aagtaaaatg tggtatgatt ttgagtggtc agcagagtgt aagaataatc tatttgcact 180 

tgatactttc agctgtcaca gaggtcatag aattgggctt attgagaagg aaaggtaaat 240 

gctagtacac tacttggctc agaagtgaac aaaattgcag tttgnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn n 411 



<210> 165 
<211> 415 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (415) 
<223> n = A,T,C or G 

<400> 165 

ggcacgagag gatatggtgc aaaaaaatat gattttgtta accacaacaa aaagaaaggt 60 

aagaaatgct aggagaaagc taaaagctcc atactaaaat aatggtccta atattaagca 120 

aagtaaaatg tggtatgatt ttgagtggtc agcagagtgt aagaataatc tatttgcact 18 0 

tgatactttc agctgtcaca gaggtcatag aattgggctt attgagaagg aaaggtaaat 240 

gctagtacac tacttggctc agaagtgaac aaaattgcag tttgnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nngtn 415 

<210> 166 
<211> 403 
<212> DNA 
<213> Homo sapiens 

<400> 166 

ggcacgagga aggtgtcagg agcatcccat ttgtgtctct ctctctacct ctgtgaaggg 60 

cgcgaatggg cagagcagaa cttctagaag ggaagatgag cacccaggat ccctcagatc 12 0 

tgtggagcag atccgatgga gaggctgagc tgctccagga cttggggtgg tatcacggca 180 

acctcacacg ccatgctgct gaagctcttc tcctctcaaa tggatgtgac ggcagctacc 240 

ttctgaggga cagcaatgag accaccgggc tgtactctct ctctgtgagg gccaaagatt 300 

ctgttaaaca ctttcatgtt gaatatactg gatattcatt taaatttggc tttaatgaat 360 

tctcatcttt gaaggatttt gccaagcatt ttgcaaatca gcg 4 03 

<210> 167 
<211> 407 
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<212> DNA 

<213> Homo sapiens 

<400> 167 

ggcacgaggg gcgacaagct gttggagctg caatgggccg cggctgggga ttcttgtttg 60 

gcctcctggg cgccgtgtgg ctgctcagct cgggccacgg agaggagcag cccccggaga 120 

cagcggcaca gaggtgcttc tgccaggtta gtggttactt ggatgattgt acctgtgatg 180 

ttgaaaccat tgatagattt aataactaca ggcttttccc aagactacaa aaacttcttg 240 

aaagtgacta ctttaggtat tacaaggtaa acctgaagag gccgtgtcct ttctggaatg 300 

acatcagcca gtgtggaaga agggactgtg ctgtcaaacc atgtcaatct gatgaagttc 360 

ctgatggaat taaatctgcg agctacaagt attctgaaga agccaat 4 07 

<210> 168 
<211> 416 
<212> DNA 
<213> Homo sapiens 

<400> 168 

ggcacgagac acaactttga gacaccccaa gtgctttctg cagaggttgt cgttggaaaa 60 

ctgtcacctt acagaagcca attgcaagga ccttgctgct gtgttggttg tcagccggga 12 0 

gctgacacac ctgtgcttgg ccaagaaccc cattgggaat acaggggtga agtttctgtg 18 0 

tgagggcttg aggtaccccg agtgtaaact gcagaccttg gtgctttgga actgcgacat 240 

aactagcgat ggctgctgcg atctcacaaa gcttctccaa gaaaaatcaa gcctgttgtg 300 

tttggatctg gggctgaatc acataggagt taagggaatg aagttcctgt gtgaggcttt 360 

gaggaaacca ctgtgcaact tgagatgtct gtggttgtgg ggatgttcca tccctc 416 

<210> 169 
<211> 386 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (386) 
<223> n = A,T,C or G 



<400> 169 

ggcacgagga atctcgcctc tgtctggtgt gttacctact gggggcacag gaacaatttc 60 

ctcaaggaga cagtggcatg gagctttgaa agacgagtag gtgttagcaa ggaaataagg 120 

aggaacgggg gttacgggca gaggagaaag cacatgccaa gtcagcaaag aaaagtagaa 180 

ttcgaaaact ttttaaaaat attactaagg attttcacaa tgctgcactg ggctagaaac 24 0 

tgaagctaaa acagatacgt ggtccctgct gctatggggc ttccgttcta gaggcaagga 300 

caggttgtga tgagggttct gaaggataga gaccaagcag ggagggtgtt gaggaggctt 360 

ctgcgagacc tgaaggatgg gaagcn 386 



<210> 170 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (391) 
<223> n = A, T, C or G 



<400> 170 

ggcacgagaa tagagggtcc ctggtgacag ggcagggcta gatctggagc ctgcacttgg 60 

cctgtgacat actgtcttgt ttctgagaat cctcccctac ttctctagtt aatctccaga 12 0 

gacttctgtg actacttaat cacaaaggaa attttcagga atattatcaa atactatttt 18 0 

agaaaaaaaa agagaaggga tttgaatgtt ttcagttcag tttagttatc nnnnnnnnnn 240 

nnnnncccaa aactcaagat tggggccccc ccctccttta accccgctaa aaagtttttt 300 

gggggtttag ggtgggttgg caaataacaa aacccccaaa agaaaagggg ggtaaacccc 360 

cttggaaaag tttcctaact ttggggggcg c 391 
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<210> 171 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (391) 
<223> n = A,T,C or 



<400> 171 

ggcacgagcc 

nnnnnnnnnn 

catctgtcat 

acatgtactt 

ggtcttctca 

gaaatacaaa 

catttgtgtt 



tgcatcgacc 
nnnnnnnnnn 
ggaaccagaa 
ctggcaggaa 
tttctattct 
attaactttt 
ttatgagttt 



catttttcct 
nnnnnnnnnn 
tctaaatcca 
agcagaataa 
gtaaagtgtc 
tctgcttttt 
taattaggcc 



catgacaaac 
nnnnnnnnnn 
aataggctgt 
aagttgactg 
acgtcttcta 
atttcacaaa 
n 



<210> 172 
<211> 385 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (385) 
<223> n = A,T,C or G 



tattggtgca 
nnnnnnnact 
tgccagtaca 
aacctgaaag 
ggcctacctc 
tcaacgggaa 



nnnnnnnnnn 
tagggccact 
gatggtaagt 
tctcggaaat 
tgtcaatatt 
cagtcttagt 



60 
120 
180 
240 
300 
360 
391 



<400> 172 

ggcacgagga 

aggaacgggg 

ttcgaaaact 

tgaagctaaa 

caggttgtga 

ctgcgagacc 

ggcccaatgt 



cagtggcatg 
gttacgggca 
ttttaaaaat 
acagatacgt 
tgagggttct 
tgaaggatgg 
angtgtaaag 



gagctttgaa 
gaggagaaag 
attactaagg 
ggtccctgct 
gaaggataga 
gaagccagga 
ggact 



agacgagtag 
cacatgccaa 
attttcacaa 
gctatggggc 
gaccaagcag 
agtgggaggg 



gtgttagcaa 
gtcagcaaag 
tgctgcactg 
ttccgttcta 
ggagggtgtt 
gtgggggtnc 



ggaaataagg 
aaaagtagaa 
ggctagaaac 
gaggcaagga 
gaggaggctt 
aggctggagg 



60 
120 
180 
240 
300 
360 
385 



<210> 173 
<211> 392 
<212> DNA 
<213> Homo sapiens 



<400> 173 

ggcacgagaa 

gcactgagca 

cagatccctt 

gagggatcag 

ttttaggccc 

gactgcacca 

cagccctctg 



aggctggaag 
gcaccggtgt 
ttacagttgg 
cttcgcccac 
attgggatgt 
cagctcgccc 
aaattatact 



ggaggcagct 
tcttcatccg 
attctccctc 
gcccactctc 
tcattagaac 
tctcctgggt 
gcttacaacc 



gcctttgttt 
gctgcacccc 
ttggatctgg 
acccggaacc 
tctgaaaact 
tccgcctggt 
at 



gccatggatg 
cgacagagct 
ctctgcctta 
tttcatctct 
acagttctcc 
tgcagagtga 



ggtaggggct 
ctttcttccc 
gtccgaccta 
tattgaagcc 
cctttatgag 
gcccatggga 



60 
120 
180 
240 
300 
360 
392 



<210> 174 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<400> 174 

ggcacgagat 

tctttgccct 

atgtacaaaa 

catgcagatt 

tttatagaac 



ggaatgacag 
gatttccgtc 
aggaccttga 
tacatttgac 
ttttgcatac 



ctttttttag 
ttttgaaaat 
actcttaaat 
tgttggagca 
tatattggat 



tagcatatcc 
ttatctggga 
tggaccagca 
atgaaagtaa 
ttacctgcgg 



ttgcgctgtg 
tgtggacatc 
aactgctgca 
acgtgtatct 
tgtgactagc 



ttagatggag 
agtgggccag 
gcgcaactct 
cttgttcatt 
tttaaatgtt 



60 
120 
180 
240 
300 
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tgtgtttata cagataagaa atgctatttc tttctgggtc ctgcagccat tggaaaaact 3 60 

tttttctttg gaaataataa ggtttttgat agat 394 

<210> 175 
<211> 387 
<212> DNA 
<213> Homo sapiens 



<400> 175 

ggcacgaggg 

gctcccccag 

tggagcatac 

attctgcctt 

aggaacataa 

gtccttgagc 

caggctgggg 



cagttagggc 
ggcagtgtgg 
cacctggtcc 
cctggagacc 
ggctgtggct 
acccactcac 
cttgctgacc 



tgccatgtgc 
tagcacatcc 
cgttcatgag 
agtgggatgg 
agacagcagg 
ctgtctatgt 
ctgcccc 



tgggagctgt 
cattgtagag 
tggtggcaaa 
gtcagtacag 
ggtctcaggt 
ggggaggaat 



gtgtctgctc 
atgagggcac 
gctagcactc 
cccaccacac 
tcatacatga 
cctacaatag 



tccttcgtcc 
cgaggcttcc 
tcacttgtcc 
cattagcccc 
ggactggctt 
gtcaccatgg 



60 
120 
180 
240 
300 
360 
387 



<210> 176 
<211> 395 
<212> DNA 
<213> Homo sapiens 



<400> 176 

ggcacgagca 

agtaagggct 

attcagatga 

tggcgagccc 

ctcgtggtgg 

ggacccgctg 

gaaggacttt 



gacctccatt 
ttttttaaaa 
gcggtttctg 
c.ccttggaag 
gaatgcagag 
ggcctttggt 
gagcagtggt 



acctccatcc 
gaataaaatg 
cattaaccct 
gttctggtgc 
ccctttgctt 
ctgcatcccc 
ggcagcactt 



ctgttggatt 
acttggtttg 
gcctatcacg 
ttcagctggc 
tccttcttgc 
tggccaggtc 
gccct 



atttaaagaa 
cgcttggaag 
catctcgtgt 
tcctgcagag 
cgcctgcttc 
cctcagggct 



agcctcagac 
caggggaagc 
cctgtgtggc 
tccaccccgc 
ctgttcctgg 
gatgcgcgta 



60 
120 
180 
240 
300 
360 
395 



<210> 177 
<211> 388 
<212> DNA 
<213> Homo sapiens 



<400> 177 

ggcacgaggg 

cccgggtggg 

caacagagag 

aggaacacag 

atatcattct 

gttgctagga 

aattaatcca 



acgctgcgga 
ccggcggcgg 
aaattatgcg 
atcttcttaa 
tttgtgtcat 
agaggctcaa 
gactggaaaa 



gcccgctcac 
ccgcgggcgg 
aatcactcca 
acaacagcat 
ctgtttttag 
tgtttgtgtt 
cccacaga 



ccgctccctg 
caggaaggag 
ggctgtcttt 
tgagaaatcg 
cctgtttatg 
ttcaccagat 



tacgtgaaca 
aaccaccagt 
gttcagagtt 
ggctggctat 
tctagaacat 
cagtttcaga 



tgactagcgg 
ggtatgtgtg 
accttgatca 
ttatccaatt 
ctatcaatgg 
gactgcttaa 



60 
120 
180 
240 
300 
360 
388 



<210> 178 
<211> 397 
<212> DNA 
<213> Homo 



sapiens 



<400> 178 

ggcacgagca 

aggacttggg 

caaatggatg 

ctctctctgt 

catttaaatt 

atcagccttt 

gaaaagtgga 



ggatccctca 
gtggtatcac 
tgacggcagc 
gagggccaaa 
tggctttaat 
gattggaagc 
agaaccctcc 



gatctgtgga 
ggcaacctca 
taccttctga 
gattctgtta 
gaattctcat 
gagacaggca 
atttatgaat 



gcagatccga 
cacgccatgc 
gggacagcaa 
aacactttca 
ctttgaagga 
ctctgatggt 
ctgtccg 



tggagaggct 
tgctgaagct 
tgagaccacc 
tgttgaatat 
ttttgtcaag 
tctaaaacat 



gagctgctcc 
cttctcctct 
gggctgtact 
actggatatt 
cattttgcaa 
ccctacccaa 



60 
120 
180 
240 
300 
360 
397 



<210> 179 
<211> 397 
<212> DNA 
<213> Homo 



sapiens 
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<400> 179 

ggcacgaggc 

gtttggcctc 

ggagacagcg 

tgatgttgaa 

tcttgaaagt 

gaatgacatc 

agttcctgat 



gtggggcgac 
ctgggcgccg 
gcacagaggt 
accattgata 
gactacttta 
agccagtgtg 
ggaattaaat 



<210> 180 
<211> 399 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (399) 
<223> n = A,T,C or G 



<400> 180 

ggcacgaggt 

gctgccaagt 

ttgagtggct 

ctgtgcacat 

acatctgtga 

ccatcaccca 

acccccacgt 



cacccctttt 
gttctgtaag 
ggctgacatc 
ctacatcacc 
gcggcacttc 
ctttggccgt 
ccggaagatc 



<210> 181 
<211> 402 
<212> DNA 
<213> Homo sapiens 



<400> 181 

ggcacgaggc 

ggagcatatc 

catctccaca 

gagccagcag 

tatccgccag 

cagcatctgt 

ccaccaacac 



tacttcgctc 
gagaacaaga 
gccacactct 
attgcacagg 
gagatccaag 
aacagtgcct 
tgctttgaga 



<210> 182 
<211> 384 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (384) 
<223> n = A, T, C or G 



<400> 182 

ggcacgagag 

cttccttgcc 

gttttgtcat 

tttagaaaac 

tctcttgtgt 

taaagttggt 

nnnnnnnnnn 

<210> 183 
<211> 384 



aagctgccgg 
tgtggctgct 
gcttctgcca 
gatttaataa 
ggtattacaa 
gaagaaggga 
ctgcgagcta 



agctgcaatg 
cagctcgggc 
ggttagtggt 
ctacaggctt 
ggtaaacctg 
ctgtgctgtc 
caagtat 



gcctccatcc 
aagatctact 
atccgagagg 
cagctggctg 
cagaaggttc 
cccccctttg 
ggagtgttta 



tcaaagacct 
tcatctgggt 
tggaggagaa 
agaagttcga 
tgaaccggag 
agcccttctt 
gctgtggcn 



gcaaggatta 
acctcatgcc 
gcgtcatcag 
atgagctgcg 
agctcaaggc 
tggagatgcc 
gttactcgga 



gtactgcaag 
acctcttcta 
ggactacctg 
ggtgcggcgg 
cagtcctaag 
ctcagtccac 
aagcgaagct 



caactcaggc 
ttcgacactc 
ttggaaaggt 
tcaacatgtt 
ggtcatgccc 
tagcatgtan 
nnnnntgccc 



ctgctgggtt 
ttaacctgga 
tgtagaaatc 
gctcttgtgt 
cagtttattt 
nnnnnnnnnn 
cttg 



ggccgcggct 
cacggagagg 
tacttggatg 
ttcccaagac 
aagaggccgt 
aaaccatgtc 



ggggattctt 
agcagccccc 
attgtacctg 
tacaaaaact 
gtcctttctg 
aatctgatga 



ggtcttcaag 
gacgcggacc 
tgaccaccag 
cctcaggacc 
tctattcaca 
caactccctg 



tcatccgtca 
cagcgtcagt 
gacctggtgt 
actatgctgt 
ggcctgcgct 
caggaggtcc 



gagtatgtgg 
gtggtgcaga 
gtccaaaaac 
taccgagagg 
attttccaaa 
ttcctgtgtg 
ga 



cagctgtcct 
ccctggccca 
tacagaaaca 
agaccacccg 
agaccaagtg 
gccactcctt 



aactgcttac 
aaaagcacta 
ctagagtatg 
taatagtttg 
aaccatccca 
nnnnnnggga 



accattttcc 
atttgtcctc 
tgacctttta 
ttctttttag 
tagatgttta 
aactcattct 



ttcccctcct 
catatctgtg 
agatgcactt 
tgttcggtat 
ttttcccttg 
cnnnnnnnnn 



60 
120 
180 
240 
300 
360 
397 



60 
120 
180 
240 
300 
360 
399 



60 
120 
180 
240 
300 
360 
402 



60 
120 
180 
240 
300 
360 
384 
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<212> DNA 

<213> Homo sapiens 



<400> 183 

ggcacgaggg 

agcagtacat 

gactctagac 

ttggtgacat 

gaattgacca 

ctaaaatgta 

agggcagaat 



aaggtgaggg 
cccttccatt 
ttgtagaaaa 
tttgctttta 
aatgcttatc 
cctattaaag 
gtaccagcca 



ctgagaagga 
ttgtttaaat 
gagccatttt 
tgtgaaatag 
tacgaggttc 
ataaatgctt 
gtca 



ggcatgccag 
tgggcttgga 
agtttcaact 
tgcacagtat 
ctagagctct 
ctaccaaagt 



cgggagaaag 


agctgcctgc 


60 


gaatctattc 


tgaaaacatt 


120 


caaatgtaaa 


gcaaggtagt 


180 


gagttaatct 


gagcaggtct 


240 


gctgaccctt 


ggccgaaact 


300 


aaaactctgt 


gagttgtttc 


360 






384 



<210> 184 
<211> 379 
<212> DNA 
<213> Homo sapiens 



<400> 184 

ggcacgagct 

tgacttccag 

tccaccactg 

ggacataccc 

gaccccagga 

agccaccagg 

tacgtcgtaa 



tcctccagcc 
gctcccagaa 
gccaaagaac 
attgtgctga 
caccagaact 
agcaacagcc 
accaatatt 



tccacagcct 
accccaccct 
atgccagcag 
tcaacggctg 
ccgttcaacc 
agaccctgtc 



tggctcagtg 
aaccatgggc 
ctgcccccca 
cccagaacca 
tggagctgct 
agatgccccc 



tccctgtgta 


caagacccag 


60 


caacccagaa 


caccccactc 


120 


tccatcacca 


actccatggt 


180 


gggtcttctc 


caccccagcg 


240 


tctcccagca 


acccctgtcc 


300 


tttaccacat 


gcccagaggg 


360 






379 



<210> 185 
<211> 368 
<212> DNA 
<213> Homo sapiens 



<400> 185 

ggcacgagac 

gttgtcgctg 

gatcgtgctg 

ctggctggtg 

ctatcgccgc 

caaggtgcgg 

gtgcccac 



ccggtccagg 
ctgcgaggac 
gcggcccaca 
gcgctggcgc 
aaggatctct 
ctgacacctg 



tgccctacgt 
gccccggaaa 
ggtgcggcct 
ggctgcacgc 
ggaaggtaac 
cgagggctgg 



cggcgcgagc 
aacggtggat 
ccgcccggcc 
ctggagggcc 
ctggggatcc 
gggccgggac 



gcgcggcagg 


tggagcacgt 


60 


ctgggctctg 


gcgacggcag 


120 


gtgggctacg 


agctgaaccc 


180 


ggctgtgccg 


gcagcgtctg 


240 


ctggccaccc 


gctgacagcc 


300 


tcggaagctg 


cgatgacccg 


360 






368 



<210> 186 
<211> 375 
<212> DNA 
<213> Homo sapiens 



<400> 186 

ggcacgaggt 

acctctgtga 

ggatccctca 

gtggtatcac 

tgacggcagc 

gagggccaaa 

tggctgtaat 



ctcacagagc 
agggcgcgaa 
gatctgtgga 
ggcaacctca 
taccttctga 
gattctgtta 
gaatt 



gagaaggtgt 
tgggcagagc 
gcagatccga 
cacgccatgc 
gggacagcaa 
aacactttca 



caggagcagc 
agaacttcta 
tggagaggct 
tgctgaagct 
tgagaccacc 
tgttgaatat 



ccatttgtgt 


ctctctctct 


60 


gaagggaaga 


tgagcaccca 


120 


gagctgctcc 


aggacttggg 


180 


cttctcctct 


caaatggatg 


240 


gggctgtact 


ctctctctgt 


300 


actggatatt 


catttaaatt 


360 






375 



<210> 187 
<211> 368 
<212> DNA 
<213> Homo sapiens 

<400> 187 

ggcacgaggc cgtgcagagc ctgtatggta agcccctagg gggctcaaag gccggccagc 

tcccaggaaa gatgtgcact gactttgaaa cctgggactc ctacagcccc caaggaaggc 

gccctgaaac gcagggccct aaatactgcc actcttcctt cgatgccatc actgtagaca 



60 
120 
180 
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ggcaacagca actgtacatt tttaaaggga gccatttctg ggaggcggca gctgatggca 240 

acgactcaga gccccgtcca ctgcaggaaa gatgggtcgg gctgcccccc aacattgagg 300 

ctgcggcagc gtcattgaat gatggagatt tctacttctt caaagggggg cgatgctgga 3 60 

ggatccgg 368 

<210> 188 
<211> 436 
<212> DNA 
<213> Homo sapiens 



<400> 188 

ggcacgagaa 

cagggggagg 

cacacccact 

tcttcatttc 

agtactacta 

gcctgtcagg 

tcctcgccag 

tcagcctcct 



ggggctgggg 
caggggactg 
ctgatgaggc 
catcacgttt 
tgggcattac 
cctcggcgcc 
ccccacggag 
catcat 



tgggctcagg 
tgcaggggct 
tcatggcctc 
gtgctggatg 
ttggacgact 
agcctcaacg 
ctgccctcgg 



caaggcctgg 
cagggaggcc 
ctggcaggtc 
ccgtcaaggg 
atcacacaaa 
ccagcctcca 
ccacacactc 



ggccctggcc 


ttcttcctgg 


60 


tcccccacct 


gccccctgac 


120 


gacggaggag 


atcatcgccc 


180 


cacggttaaa 


atcttctgga 


240 


aaggacttca 


tcccttgtca 


300 


cactgccctc 


aatgccagct 


360 


aggccaggcg 


accgccgtgc 


420 






436 



<210> 189 
<211> 435 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (435) 
<223> n = A,T,C or G 



<400> 189 

ggcacgagac 

tgtagtgtgg 

ggaaacatct 

tcatccagat 

ctgaaaccca 

atatgggaaa 

gcacacttga 

aaatagttta 



agaccctttc 
agcactaacc 
agactcactg 
ttcactcatt 
ccaagagctg 
tgctgctgct 
aacactgcgt 
cctga 



ttcctaaagg 
atggcagggt 
cattaacaga 
ctgaccacag 
catagagcac 
atacttttag 
atcattntcc 



ctttgtggca 
aatttattcc 
gcattttgtt 
ccaggaagct 
gtttagctag 
gaatttctga 
tcactcatga 



tcagacacat 
aggcacagag 
tctaaagtag 
gagggtgaag 
agtaggagtt 
gtgcaattta 
atatagtcat 



aaagggtata 
tcataattct 
acctcttatg 
ccagaattag 
tgcagtgctc 
gaaacatcta 
cagaattcat 



60 
120 
180 
240 
300 
360 
420 
435 



<210> 190 
<211> 437 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . . . (437) 
<223> n = A,T,C or G 



<400> 190 

ggcacgagat taggaccctt ccttggcaca ggggtgagaa agagcttggg gaacgcttgg 60 

cattatggag ggctggaagg ggctcaaccc cgatttggag agaagtttgg gatggagtgg 12 0 

gcgagagatt gagagagcga gcaggaaaag aggtcttgga gcctgggact gatggtggat 18 0 

aaggcctgga aagaagatga cgaggaggag gagagaggga agtggggtgg atgaggagca 24 0 

ggctgacacc tgggctgccc tcaatcccca aggccaggga gggcggngct ggcccctggg 300 

aagaactggg tctctgggct ccctatgcac tgcccaaact ggctgagcca ggagtggggc 3 60 

aggaagtgag agtcaaggcc cagcaaaagg agggggagga gctgccaatt ataaccttgt 420 

gganggaccg gtttgng 437 



<210> 191 
<211> 434 
<212> DNA 
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<213> Homo sapiens 



<400> 191 

ggcacgagaa 

tttagaaact 

tgataggttg 

agggcatgga 

agggggatgt 

gaactgagca 

tgaaagtgag 

acacacttaa 



gaaactgtga 
tgtgaaaaat 
agctcacatg 
gacagacaat 
aaatttgata 
tgttgtaggt 
cagctaggga 
agtg 



agggaaagaa 
ggcttaatct 
gtggagagcc 
aggttgaatg 
aataggttgg 
ataaggtaaa 
agaaaactcc 



aggtttatac 
aaatgagtgt 
acagttgcgg 
ctcttttttt 
tgaaaactta 
aaggcgtgaa 
cagaggaaga 



tgagaaatgg 
taggggagat 
gtgcttgcac 
acaaaaggaa 
tattttcttg 
gaggaatatt 
gggaggcaag 



aagagataat 
acagttgtga 
tgataatgtg 
gtagaaaggg 
taaagagaga 
tcgttgataa 
gaaatcaaga 



<210> 192 
<211> 323 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (323) 
<223> n = A,T,C or G 



<400> 192 

gggtctctcg 

nnnnnnnnnt 

ccctcatgcg 

cctatacaca 

cgcgcgcgcc 

acgcgcgcgc 



cccccctctc 
ctctctatat 
ctctctcatt 
tctccctctc 
accccctctc 
gctctgtgtt 



tctcttttgt 
ctcgcgcgcg 
tgtggagaga 
tgtgaaagag 
ctgggggggg 
gtt 



gtgtctctct 
cgcactcccg 
gagaccgcta 
acgtgtgtgt 
tgtcctctct 



ctctgtcccg 
tgtgtgtgtg 
tctatctctc 
gtctccacac 
gtatatatat 



tgtgtgnnnn 
tgaccccgcc 
tctcccccgc 
cccttgggcg 
atgtgcacac 



<210> 193 
<211> 412 
<212> DNA 
<213> Homo sapiens 



<400> 193 

ggcacgagaa 

ggcctcccag 

ggctggggct 

gcacgagggg 

gcagcctcct 

ggaagctccc 

gtgctgccag 



ggggccgtga 
gagctccagt 
gagctttaaa 
atggggatgc 
tgcctggaac 
tccctcactg 
ctgctcccaa 



cagccgttgc 
gacagcccca 
atggttccga 
ttccgccttt 
tcactcactc 
tggggcattt 
tgtgccgatg 



catctgctgc 
tcccaggatg 
cttgtccctc 
ccggggctgc 
tgggtgcctc 
caccattcaa 
tccgtgggca 



cggagccggc 
ggtgtctggg 
tctcagccct 
tggcctggcc 
ctccccaggt 
acaggtcgag 
gaatgacttt 



acctggcgca 
gagggtcaag 
ccatggcctg 
cttgagtggg 
ggaggtgcca 
ctgtgctcgg 
ta 



<210> 194 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (405) 
<223> n = A,T,C or G 



60 
120 
180 
240 
300 
360 
420 
434 



60 
120 
180 
240 
300 
323 



60 
120 
180 
240 
300 
360 
412 



<400> 194 

cgttgctgtc 

nngaaccctc 

tggcaaaacc 

tctttgcttt 

nnnnnnnnnn 

gtctcgccct 

gtgccgttga 



ggtcagcaat 
gggggccctt 
ccccaataaa 
ttttggaccc 
gtttcagggc 
gctgcgctgc 
gcgtccaccc 



gaaataaata 
ttttcccgaa 
agggggggaa 
ctttaaagcg 
cnnnnnnnnn 
cgttttctcg 
tattctttcg 



tcttgtagaa 
acccccactg 
aaaaaggctt 
gggaaaacca 
nnnnnnnnnt 
ttccactccc 
cgccggtgca 



tgttcnnnnn 
gaaaaaaacc 
tttttggaaa 
ggttaacccc 
tttttccctn 
cccgtttttg 
ccccc 



nnnnnnnnnn 
cttggggggt 
aatggggggg 
ccccaggggc 
tctcccttct 
tactcccccc 



60 
120 
180 
240 
300 
360 
405 
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<210> 195 
<211> 400 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (400) 
<223> n = A,T,C or G 

<400> 195 

ggcacgagat taggaccctt ccttggcaca ggggtgagaa agagcttggg gaacgcttgg 60 

cattatggag ggctggaagg ggctcaaccc cgatttggag agaagtttgg gatggagtgg 120 

gcgagagatt gagagagcga gcaggaaaag aggtcttgga gcctgggact gatggtggat 180 

aaggccttga aagaagatga cgaggaggag gagagaggga agtggggtgg atgaggagca 240 

ngctgacacc tgggctgccc tcaatcccca aggccaggga gggcggngct ggcccctggg 30 0 

aagaactggg tctctgggct ccctaggcac tgcccaaact ggctgagcca ggagtggggc 360 

aagaaatgag agttcaggcc caacacaagg agggggaggg 400 

<210> 196 
<211> 402 
<212> DNA 
<213> Homo sapiens 

<400> 196 

ggcacgagat taggaccctt ccttggctca ggggtgagaa agagcttggg gaacgcttgg 60 

cattatggag ggctggaagg ggctcaaccc cgatttggag agaagtttgg gatggagtgg 120 

gcgagagatt gatagagcga gcaggaaaag aggtcttgga gcctgggact gatggtggat 18 0 

aaggcctgga aagaagatac taggaggagg agagagggaa gtggggtgga tgaggagcag 240 

gctgacacct gggctgccct caatccccaa ggccagggag ggcggggctg gcccctggga 300 

agaactgggt ctctgggctc cctaggcact gcccaaactg gctgaaccag gagtggggca 360 

agaagtgaga gtcaaggccc aacaaaagga gggggaggag ct 402 

<210> 197 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 197 

ggcacgagct ctcagcggcc ggtttctgcg tccgctgccg caggttccac cgcgctccag 60 

gtattttttt ttctgaagga aagctgcttc ctcatatgtt tcaagaatgg ctctccctat 12 0 

cattgtaaaa tggggtggac aggagtattc agtgaccaca ctttcagaag atgatactgt 180 

gctcgatctc aaacagtttc tcaagaccct tacaggagtt cttccagaac gccaaaagtt 24 0 

acttggactc aaagttaaag gcaaacctgc agaaaatgat gttaagcttg gagctctcaa 300 

actgaaacca aatactaaaa tcatgatgat gggaactcgt gaggagagct tggaagatgt 360 

cttaggtcca ccccctgaca atgatgatgt tgttaatgac t 4 01 



<210> 198 
<211> 397 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (397) 
<223> n = A,T,C or G 



<400> 198 

tgcatattag acattcttaa cagggcggca gtctagtgtt gaaagtttta tttttccatt 60 

tttcttttaa gcaaattttt tttaaaaaat tctgattnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tctgatttaa 24 0 

ttcttttatt tatcataagg ggtttaattc ctgaagtaaa ggtttgcacc tattaaactt 300 
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aaaactgcca aatgattttt gttcttttat gtgcgcgata gaaatacaaa gaatggagtg 360 
gccacctcct ccctttcaag ctagggcagc agggacg 397 

<210> 199 
<211> 398 
<212> DNA 
<213> Homo sapiens 



<400> 199 

ggcacgagaa 

aatggcttaa 

atggtggaga 

aataggttga 

ataaataggt 

ggtataaggt 

gggaaaaaaa 



gaaaggttta 
tctaaatgag 
gccacagttg 
atgctctttt 
tggtgaaaac 
aaaaaggcgt 
cttcccaagg 



tactgagaaa 
tgttagggga 
cgggtgcttg 
tttacaaaag 
ttatattttc 
gaagaggaat 
aggaggggag 



tggaagagat 
gatacagttg 
cactgataat 
gaagtagaaa 
ttgtaaagag 
atttcgttga 
cagggaaa 



aattttagaa 
tgatgatagg 
gtgagggcat 
gggaggggga 
agagaactga 
taatgaaagg 



acttgtgaaa 
ttgagctcac 
ggagacagac 
tgtaaatttg 
gcatgttgta 
gagcaactta 



60 
120 
180 
240 
300 
360 
398 



<210> 200 
<211> 394 
<212> DNA 
<213> Homo 



sapiens 



<400> 200 

ggcacgagca 

gtggctgcgc 

aggtaataaa 

cctccgagct 

tacacaggga 

cgtgatcagt 

ccgacttgct 



gaaggcagcg 
tgggacaaga 
gaagaactac 
gctatgggac 
ttttgcagat 
tttgacgccc 
gcaaccacat 



gtctaggcga 
attccttaac 
gaaaatgttt 
ctggaatttc 
acctggaaaa 
gagctcatcc 
ttatcagtca 



ggacgcccgg 
tttggaggca 
tggggcccga 
tcgtatgaat 
acaattcagt 
atccagtggg 
gggg 



ctggaccagg 
gtgaaacgac 
atggagtttg 
gacttgacca 
gacttaaagc 
ggtagcagca 



agaccgccca 
taatagcaga 
ggacagctgg 
tcatccagac 
agaaaggcat 
gaaggtttgc 



60 
120 
180 
240 
300 
360 
394 



<210> 201 
<211> 391 
<212> DNA 
<213> Homo 



sapiens 



<400> 201 

ggcacgagca 

caaagcacag 

tcccactttc 

agccgtgagg 

aacctccaga 

ggagcttact 

caccaagcct 



ggcgtgtctg 
ggctcagctc 
cagcagcact 
gcaccaggtt 
gggggtgagc 
ggctcacggc 
gggtgctctc 



ggtaaccatg 
cagagggaga 
tggtgccagc 
caggcccggc 
tgggaacttg 
tgatgcccca 
cagcagacgg 



tggctcctgc 
cgggctgggc 
agaggctgca 
atctcagggt 
tgtgaagggg 
gacagcctcg 



tggcctcccc 
tgtcagtggt 
ggtgtggcag 
ggagagccag 
ctttttccaa 
aggatctgca 



tgcctgtccc 
cccaggtgca 
gagggggccc 
ggctgtcctg 
aaggaaaacg 
ggtccccaga 



60 
120 
180 
240 
300 
360 
391 



<210> 202 

<211> 392 

<212> DNA 

<213> Homo 



sapiens 



<400> 202 

ggcacgagat 

tatcatgtgg 

gtatatgtag 

attctggaaa 

ttatgtcatc 

attagctgaa 

tgctcatatg 



tctcagtaca 
gacacagacc 
tgtggagcac 
catctagact 
cagatttcac 
acccaccaag 
ggaaatgctg 



ctaaacactt 
ctttcttcct 
taaccatggc 
'cactgcatta 
tcattctgac 
agctgcatag 
ctgctatact 



gttaagagtg 
aaaggctttg 
agggtaattt 
acagagcatt 
cacagccagg 
agcacgttta 
tt 



ttgttaagag 
tggcatcaga 
attccaggca 
ttgtttctaa 
aagctgaggg 
gctagagtag 



ccagagtgag 
cacataaagg 
cagagtcata 
agtagacctc 
tgaagccaga 
gagtttgcag 



60 
120 
180 
240 
300 
360 
392 



<210> 203 
<211> 392 
<212> DNA 
<213> Homo 



sapiens 
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<220> 

<221> misc^feature 
<222> (1) . . . (392) 
<223> n = A,T,C or G 



<400> 203 

ggcacgagga 

gccgcttctt 

gccgcttcca 

cggcgctgga 

gctgtcgccg 

gcttatatat 

tggagactga 



ggagcccgcc 
gagcggcctg 
gggccgcgcg 
ggcgcggctt 
cgctggaata 
ggaagaaatt 
aaaaactccc 



ccggaggctg 
gagctggtga 
gcggtgatca 
ggcagacggc 
tctgccccag 
gaaggctcag 
cagggtctct 



aggctctggc 
agcagggtgc 
agcaccgctt 
ggacggtgca 
ttgtcttttt 
tgactgttcg 
cn 



cgcagcccgg 
cgaggcgcgc 
ccccaagggc 
ggaggcccgg 
tgtggactat 
agattatatt 



gagcggagca 
gtgttccgtg 
taccggcacc 
gcgctcctcc 
gcttccaact 
cagtccacta 



60 
120 
180 
240 
300 
360 
392 



<210> 204 
<211> 386 
<212> DNA 
<213> Homo sapiens 



<400> 204 

ggcacgagaa 

taagaaacac 

gggtttgtgc 

ttaactgatt 

aaatcttaat 

tattttttta 

caacgtcttt 



gccttaaacc 
acttaagagc 
caatattcac 
tctgtcattc 
ttaaaagcct 
aaagtaagaa 
caggggttgg 



gggaaatttc 
atcagattta 
tacgtattat 
atcaatgagt 
cattttccta 
attctgagtt 
agacag 



catgctatct 
ctgattgcat 
gcagtattta 
agaagtaaat 
gaaatctaat 
gtcttcttgg 



agaggttttt 
tttatgcttt 
tatcttttgt 
acattatagt 
tattcagtta 
agctgtaggt 



gatgtcatct 
aagtacgaaa 
atgtaaaact 
tgattttgct 
ttcatgacaa 
cttgaagcag 



60 
120 
180 
240 
300 
360 
386 



<210> 205 
<211> 295 
<212> DNA 
<213> Homo sapiens 



<400> 205 

gcgctctctt 

atatgtgagc 

aagtgtgttc 

ttatatcgct 

aaaacatgtt 



cacacacaaa agatatatat atagaaaggg agtgtggata tcccccctaa 60 

gtgtctctct cgaccgtctc ccccagagaa aatatctcta gagagagcac 12 0 

tctgtgtctt gtgtgtgaga aaaaataagt gcccgcgcac acatagattt 18 0 

cccccccgcg cctttatata tgtttttggt gtgtatatat attttataca 24 0 

tctttttgag gccccttaca acaaaaattt tgttcttttt gaacc 295 



<210> 206 
<211> 383 
<212> DNA 
<213> Homo sapiens 



<400> 206 

ggcacgaggt 

gatgggctct 

ggaccctccc 

tgctgcccac 

cttctggctc 

cctttgtcct 

caaagtgggg 



tacccatcag 
gatgagaggg 
cctgccttgc 
actcctgctt 
atgtgtctga 
ccccatagag 
gagggggatg 



cccttgcaag 
taaaagatgc 
agtgggctcg 
tttgggatat 
caccaacaac 
ctggggtggg 
gca 



tcccccactc 
tcagggaaac 
ggtagagcag 
ctaactgcta 
atggtctctg 
gtggatccct 



aggcctctgg 
acaggcctca 
tatcaggagc 
aggagggagt 
tccctctctc 
atacctgggg 



aaggtccagg 
gctgcctaga 
tagggttgtc 
tgacatcccc 
tttgactctc 
caggcagccc 



60 
120 
180 
240 
300 
360 
383 



<210> 207 
<211> 385 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (385) 
<223> n = A,T,C or G 
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<400> 207 

ggcacgagct 

ggaactacac 

ccatgccagt 

acattgacaa 

ggctcccgga 

cacttcggcc 

ccctcctaga 

<210> 208 
<211> 374 
<212> DNA 
<213> Homo 

<400> 208 

ggcacgagcc 

cagtatcagg 

ctaaggaggg 

ctgtccctct 

cctatacctg 

gccactggac 

ctcaaggcct 



tcaggataag 
ccgcttcgat 
cttccagtcc 
tgccatgagg 
attctacaac 
agaacttatt 
actcggaaga 



sapiens 



tcagctgcct 
agctagggtt 
agttgacatc 
ctctttgact 
gggcaggcag 
tctggcaagg 
gagg 



aagctcatgg 
gactggtacc 
ttggaggcct 
accttcctca 
attcctcagg 
gaaagcgcaa 
gatgg 



ccatgttcct 
tgtgggttca 
actggcctgg 
actactacac 
gatacacagt 
tgtacctcta 



agagtataac 
gatgtacaag 
tcttcagagc 
tgtatggaag 
ggagaagcga 
ccgtgccacg 



aaagccatcc 
gggactgtgt 
ctcattggag 
cagtttgggg 
gagggctacc 
gnggatccca 



agaggaccct 
gtctgctgcc 
ccccttctgg 
ctccctttgt 
ccccaaagtg 
cctttattac 



ccccctgcct 
cacactcctg 
ctcatgtgtc 
cctccccata 
ggggaggggg 
ctttactccc 



tgcagtgggc 
ctttttggga 
tgacaccaac 
gagctggggt 
atggcagaga 
ctccctctcc 



tcgggtagag 
tatctaactg 
aacatggtct 
ggggtggatc 
ctgtaaaggc 
catcaccagc 



60 
120 
180 
240 
300 
360 
385 



60 
120 
180 
240 
300 
360 
374 



<210> 209 
<211> 425 
<212> DNA 
<213> Homo sapiens 



<400> 209 

ggcacgagcc 

ccaattgcaa 

tggccaagaa 

ccgagtgtaa 

gcgatctcac 

atcacatagg 

acttgagatg 

gctct 

<210> 210 
<211> 396 
<212> DNA 
<213> Homo 

<400> 210 

ggcacgagga 

cggttttctt 

tgcctgtaaa 

atgtgaagct 

tacttatgta 

tgatcctgtc 

ccgagcaaag 

<210> 211 
<211> 267 
<212> DNA 
<213> Homo 

<400> 211 

tctctagaga 

gcacgctctc 

gatgtagtga 

ctgtgctggc 

actttcttta 



caagtgcttt 
ggaccttgct 
ccccattggg 
actgcagacc 
aaagcttctc 
agttaaggga 
tctgtggttg 



ctgcagaggt 
gctgtgttgg 
aatacagggg 
ttggtgcttt 
caagaaaaat 
atgaagttcc 
tggggatgtt 



tgtcgttgga 
ttgtcagccg 
tgaagtttct 
ggaactgcga 
caagcctgtt 
tgtgtgaggc 
ccatccctcc 



aaactgtcac 


cttacagaag 


60 


ggagctgaca 


cacctgtgct 


120 


gtgtgagggc 


ttgaggtacc 


180 


cataactagc 


gatggctgct 


240 


gtgtttggat 


ctggggctga 


300 


tttgaggaaa 


ccactgtgca 


360 


gttcagttgt 


gaagacctct 


420 






425 



sapiens 



gcaaggaagt 
atctatgaaa 
cagtgcctgg 
ttgctctgtg 
gctatctctg 
tgtgttccct 
aaagcatcaa 



sapiens 



cacacagaga 
tctctctctc 
gagtttgatg 
cctatcttgt 
tatgtagtgc 



aatattgtca 
tgaggctaag 
catatagctt 
aggtctgtat 
aaacttagaa 
gttcccagca 
agtggcgggt 



tatttgcagt 
aataataaaa 
attattcatc 
tacaatcaca 
atgaaatcat 
taaagcctaa 
cgggcc 



tgagaatgat 
tagagaatta 
cagctaagag 
ttcagttata 
cgaggaaaaa 
cacgtattag 



ccctgagtct 
aatgagataa 
gcccttccat 
gctaattatt 
ggccatttct 
gctaatgtca 



gggtgagcgg 
tctatccctc 
catatttgtc 
gtgtatgttt 
cagttcc 



ctctctcaca cgcaccccag agtcaggcgc 
agaaagatct tcctttttcc ctctccctgt 
cgtgtccgcc cccacagacc ctctacctct 
ccctctctct ctcgcgcgcc cacacgatgt 



60 
120 
180 
240 
300 
360 
396 



60 
120 
180 
240 
267 
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<210> 212 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 212 

ggcacgagcc aggaggaccc tcgcttcctc tccgccatgc ttgccacctc ttgcttctga 60 

gagtccatct cagttcgcag ttctgtgact tgcattgacc tggctccaat caagctacaa 12 0 

ctcaagcagt cacggggaga aggattgtag atgggccagt gactcacagg gtcaggcact 180 

cgggggagcc tgagtcagga ggtcagtggg ccctggaagg gagggggcaa gcctgggtgg 24 0 

gtaaggttct gggccccagg caagaaggca gagtttctcc gcaggggtgt gtgcaagagc 3 00 

tagctgcgca gaaggtctcc gctggctctc caagccgggc ttgtgaaata ggaacgccaa 360 

catcctcctc cacaggcagt ggcaggcacc tcctcc 396 



<210> 213 
<211> 284 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (284) 
<223> n = A,T,C or G 



<400> 213 

tgggctgtct cgcccctcct ccctctctct ttgtactcac agtgaaaaat tatagtgttc 60 

gcgtgcgggg cgcgctcttt actttttttt ctctctcaca catatttata tatatagaga 12 0 

gagcctccga gcgctctgcc cccctcctct ctctctctct tcacgtgtgt gcatcaccca 180 

ctcnnnnnnn nnnnctcttc cagagatacg ggggcttgtt tcctccgctc tctctcacac 240 

gtctgtgcag cagaggacta tttttttctt tcccccgcgt ctcn 284 



<210> 214 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (440) 
<223> n = A, T,C or G 



<400> 214 

ggcacgaggg 

tcataccaag 

tagaaagcgt 

ggaagaaaca 

aagaaattaa 

ctataaatac 

ttaaaatcct 

ctttgttcac 



attgcagtca 
gagtggagag 
ttctcttcct 
atacaggatt 
tgctggagta 
ttataaatta 
caggttgcaa 
ctatagaaan 



<210> 215 
<211> 439 
<212> DNA 
<213> Homo sapiens 



<400> 215 

ggcacgagtg 

ccaggggtga 

gacagccagc 

gagcccggca 

gctctgggct 

ctcccttcta 



cacaggggac 
ggggcaggca 
acccagccca 
tatacccagg 
ggagcgaggg 
gcctctgacc 



gcactttctg 
ttgaagtttc 
gagtatttca 
gcctttaatt 
tggaggggtg 
taaacacaat 
aaccaaaatg 



acttacggac 
gcagttgtag 
tcaccactcc 
gcacccctac 
ccaggttttg 
cagtctggtc 



aatgttttca 
ctcccagtga 
aaaagatgta 
aattaagaat 
ataaccttaa 
ataattaaaa 
ttaaccaaaa 



cacagtatgc 
ctccagtgac 
aaagagctgg 
tgcctcctga 
agattataaa 
ttagaacatc 
caaatactca 



acagaaatgc 
ttactgccgc 
ctgggctggc 
ggctgccgcc 
ggccgaggct 
tggcttgcat 



acaggggagg 
ggggcactgc 
tggcaggtat 
agtctcatgc 
tccccaggca 
ggatgtaggg 



aaagcttaca 
agaccacacc 
ggagagtatg 
taaaaggaaa 
tatttgttgt 
aggaaaagaa 
tgagattcaa 



ccgagcataa 
tatgtgcagg 
ggcaccctgg 
ccaggtgggt 
atcctgtgag 
cttggggtgg 



60 
120 
180 
240 
300 
360 
420 
440 



60 
120 
180 
240 
300 
360 
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gaagttcagg tcctggcttt gcctttgcct gatgtggatg agcagctcac atgctcaggg 
ccacctgaga ctgtcactg 

<210> 216 
<211> 392 
<212> DNA 
<213> Homo sapiens 



420 
439 



<400> 216 

ggcacgagga 

aggaggcaga 

atcatggcgt 

cagcagggca 

acgaggccgg 

ccgggaagga 

aggaagcaga 



gacagagaag 
gaagtttggc 
tagtcaggct 
ggctgggaaa 
gaaggaggca 
agcagacaaa 
gaaacttggc 



tttggccagg 
caggtgggga 
ggaagggagg 
gagggagaca 
gggcagtttg 
gcggtccaag 
ccaggggtca 



gggtccacca 
aggaggaaga 
cggggcagtt 
tagcagttca 
gccagggagt 
ggttccacac 
ac 



tactgctggt 
cagagtggtc 
tggccacgac 
tggtgtccaa 
tcaccatacc 
tggggtccac 



caggttggga 
caaggcctcc 
attcaccaca 
cctggggtcc 
cttgaacagg 
caggctggga 



60 
120 
180 
240 
300 
360 
392 



<210> 217 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<400> 217 

ggcacgagcc 

cctcgtggtc 

cctgtacgag 

gaaggagatc 

caagaccatc 

ccgcatgaaa 

gcgggagctg 



catctggggc 
tttgcggaca 
gggacgaagg 
ttgaggctac 
tccaacagcc 
cagctgcggc 
gagcagctta 



agcaccacgt 
gtctcattag 
tggtgtttcg 
agaatggcct 
ggaagctgat 
agctgcgcat 
aggcagagta 



ggatctctcc 
caacatcctc 
ctatgccttg 
ggaaatctac 
gaacatcgcc 
ggtccaccgg 
cctg 



ctcgtcacct 
cttcgggtct 
gccattttca 
cagtacctgc 
ttcaatgaca 
gagcggctgg 



tcaactggtt 
gggatgcctt 
agtacaacga 
gcttcttcac 
tgaacccctt 
aggctgagct 



60 
120 
180 
240 
300 
360 
394 



<210> 218 
<211> 432 
<212> DNA 
<213> Homo sapiens 



<400> 218 

acacccactt 

acgtgcctta 

caaacctgca 

catgaggatg 

tgatgatgtt 

agaaaaccta 

tcccagggaa 

gtcttgtgca 



gtttgaggac 
caggagttct 
gaaaatgatg 
gcatctcgag 
gttaatgact 
ctgaaaattt 
gggaaaaagc 
ag 



accatcgatt 
tccagacacg 
ttaagcttgt 
aggagagctt 
ttgatattga 
ctcgcagagc 
ttttggtgct 



cgaattcggc 
ccaaaagtga 
agctctcaaa 
ggaagatgtc 
agatgaagta 
gaaagagtac 
agatgttgat 



acgagcctag 
cttggactca 
ctgaaaccac 
ttaggtccac 
gttgaagtag 
aaagtggaaa 
tatacattat 



ccagcccctg 
aagttaaagg 
atactaatat 
cccctgacaa 
aaaataggga 
ttttgaatcc 
ttgaccacag 



60 
120 
180 
240 
300 
360 
420 
432 



<210> 219 
<211> 395 
<212> DNA 
<213> Homo sapiens 



<400> 219 

ggcacgagcc 

tatctagctt 

gagagaaaag 

tgtattttat 

aatcgccact 

ggctgtgaag 

cattgatgag 



ctttactcct 
gcaggatttg 
acagacagag 
caatatgatg 
ggagatatcc 
cggcagcagg 
tgccgggccc 



ctacccaaga 
agttgaggaa 
tcaaatctac 
taaagtctga 
cgtttcaagg 
agccactggg 
atgatccctc 



tcttgcttgt 
aacacagact 
agcatatctc 
aatatacagc 
ctgtaattct 
tgaagactgc 
tgtgc 



ttctttctaa 
tccatgagtt 
tcacctcagg 
tttggaatcg 
gagaagatcc 
ccttcagagc 



gttgcctctc 
tgggaactac 
aactggaaga 
tcctctggga 
gcaagctggt 
tgcgggagat 



60 
120 
180 
240 
300 
360 
395 



<210> 220 
<211> 487 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc__f eature 
<222> (1) . . . (487) 
<223> n = A,T,C or G 



<400> 220 

tgctcttttg atgatgccat cgattcgaat tcggcacgag cagctagctc agttcaaggt 60 

ggaaatggct taacgagagg aacggcaaca gcaggtggct gaggactacg agctcagact 12 0 

ggcccgggag caagcgcgag tgtgcgaact gcagagtggg aaccagcagc tggaggagca 18 0 

gcgggtggag ctggtggaaa gactgcaggc catgctgcag gcccactggg atgaggccaa 24 0 

ccagctgctc agcaccactc tcccgccgcc caaccctcca gctcctcctg ctggaccctc 300 

cagccccggg cctcaggagc ccgagaagga ggagaggagg gtctggacta tgcctcccat 360 

ggccgtggcc ctgaagcctg tattgcagca gagccgggaa gcaagggacg agctacctgg 42 0 

agcgcctcct ggtttttgca gntcctcctc agatcttagc ctcctggtgg gcccctcttt 480 

tcagagc 487 



<210> 221 
<211> 365 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (365) 
<223> n = A, T, C or G 



<400> 221 

ggatgccagt ggtgaggctg taagcgaaac tcttcagttt aaagctcaag atctcttaag 60 

ggcagtccca agatccagag cagagatgta tgatgacgtc cacagcgatg gcagatactc 120 

cctcagtgga tctgtagctc actctagaga tgccggaaga gaaggcctga gaagtgacgt 180 

atttccaggg ccttccttca gatcaagcaa cccttccatc agtgatgaca gctactttcg 240 

caaagaatgt ggccgggatc tggaattttc tcactctgat tctcgggacc aggtcattgg 300 

ccaccggaaa ttggggcatt tccgttctca ggactggaaa tttgcgctcc gtggttcttg 360 

ggaan 365 



<210> 222 
<211> 376 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (376) 
<223> n = A, T , C or G 



<400> 222 

ggcacgagga gatttcccgg cgggtcccgg cctctgcgtg cacgcgcctg cgtgctcgcg 60 

ctcgcggttc tggcgctgct nnrmnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18 0 

nnnnnnnnnn nnntgatggc agcagtggcc tgcctgaagc agccactgcc aagaatctag 240 

ctgcagtnnn nnnnnnnnnn nnnnnnnnca tgctccacac agccaccgga agccaagaac 300 

gcaccctcct gggtacagct gcaagccgcc agccgaggct gcggacccgg gcctccctgg 360 

tgctctgggg gttggg 376 



<210> 223 
<211> 399 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 
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<222> (1) . . . (399) 
<223> n = A,T,C or G 



<400> 223 

ggcacgaggg 

ctccctcagg 

gctggccagt 

gcaggtggct 

gcagagtggg 

catgctgcag 

caaaccttca 



gtgacagagc 
gaacatcaca 
cagctagctc 
gaggactacg 
aaccagcagc 
gcccactggg 
gcttcttctg 



ggctggcgca 
ggaagcagct 
agttcaaggt 
agctcagact 
tggaggagca 
atgaggccaa 
cttgaccctc 



tgctcagtag 
g.caggacctg 
ggaaatggca 
ggcccgggag 
gcgggtggag 
ccagctgctc 
cagccccgn 



agcagcctac 
agtggacagc 
gaacgagagg 
caagcgcgag 
ctggtggaaa 
agcaccactc 



ggcaagcagc 
accagcagga 
aacggcaaca 
tgtgcgaact 
gactgcaggc 
ttccgccgcc 



<210> 224 
<211> 402 
<212> DNA 
<213> Homo sapiens 



<400> 224 

ggcacgaggg 

aatgtgggca 

ggatgtggcc 

cttcttgatg 

ctcaggcaca 

catcctcccc 

aagatgatac 



cagttcagta 
acggttcatc 
atgtccttcc 
ggcatggaca 
acccaactca 
tgcttcatcc 
caaaaagccc 



tcgatggaca 
ctggagccag 
attacatctc 
gcaccctgga 
gggccacagc 
tccctggcat 
ctgtgagcac 



gatcttccta 
aaagatgaaa 
aatgggagac 
gccaagtgca 
caccaccctc 
ctgaggagaa 
ggtcttgatc 



ctctttgact 
gaaaagtggg 
tgcataggat 
ggagcaccac 
atcctttgct 
tcctttagag 
ag 



cagagaagag 
agaatgacaa 
ggcttgagga 
tcgccatgtc 
gcctcctcat 
tgacaggtta 



<210> 225 
<211> 270 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (270) 
<223> n = A,T,C or G 

<400> 225 

ctctctttct ttctccctcc ccccccgggc gcgctcattt atctcgtctc ttatgtctct 
ctctctgtgt ctgtgacaga cacactcttt ttcatatagc gcgctccctt ttctttgctc 
tcgggggggg tctctctgta cgcgtgtgtt ctctctccag tgagtgtgca cgcctaggtg 
agagagagtn nnnnnnnnnn nnnnntgtgt gtgaatttta tatatttcta tatctctcac 
tctctgggtg tcacactctc cgtgtgtggg 

<210> 226 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (404) 
<223> n = A,T,C or G 



<400> 226 

ggcacgagaa 

gacatgtcag 

aggagatggc 

cagaccaggg 

caggagtctg 

tgagacagaa 

nnnnnnnnnn 



ccctcccagg 
ctgaggagta 
atctgaggcc 
aagggatgag 
ctcgatccag 
caggaccagg 
nnnnnnnnnn 



ctaagcccca 
attgaccaga 
caggctgggg 
agcctaagaa 
gcctgggtcc 
ggcctgcatt 
nnnnnnnnnn 



atttggggct 
tttctgcttt 
agaagggtgc 
gaggagctct 
ctggttccct 
cccccttgta 
nnnnnnnnnn 



cgcctgccct 
agaaatatgg 
tgggatgaga 
caccctgaga 
ctgagcttgg 
ttattcatct 
nnnn 



gcatcaggga 
cagtggaggc 
acctggagtt 
caggctggtg 
gaggactatg 
tcnnnnnnnn 



60 
120 
180 
240 
300 
360 
399 



60 
120 
180 
240 
300 
360 
402 



60 
120 
180 
240 
270 



60 
120 
180 
240 
300 
360 
404 



<210> 227 
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<211> 389 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 
<222> (1) . . . (389) 
<223> n = A,T,C or G 



<400> 227 

ggcacgagaa gtcactcaac ctctctgagc cttttcttca cctataaagt ggggatagta 60 

actacctacc ttatggaagc atatgaggat tgtgtgaaat catccatgta gcccttccac 120 

cgccacgtgg agtttggcat ggagcagttt ctaaatggaa gtcatcttga tcaggtgggc 18 0 

tgccaacctc tctgagcctc agtttgctct tctagggaat ggggacaatg caatgggaat 24 0 

ctgaggattg tgtgaaattg tgcaaatgca tgaatgtggg ctgggatagt aaaagggagg 300 

gccccggagc agcccacctg gggtcctatc tagtggacgc gcccggtgcc cacccattgc 360 

tgtgatgcca gcagcccact gcaagcatn 389 



<210> 228 
<211> 384 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mi sc__f eature 
<222> (1) . . . (384) 
<223> n = A,T,C or G 



<400> 228 

ggcacgagct gccacctcta gaaagctgct ttcttctatc accgcttgcc cttgaattat 60 

tccctgaatg aagccaagaa ccctcccagg ctaagcccca atttggggct cgcctgccct 12 0 

gcatcaggga gacatgtcag ctgaggagta attgaccaga tttctgcttt agaaatatgg 180 

cagtggaggc aggagatggc atctgaggcc caggctgggg agaagggtgc tgggatgaga 240 

acctggagtt cagaccaggg aagggatgag agcctaagaa gaggagctct caccctgaga 300 

caggctggtg caggagtctg ctcgatccag gcctgggtcc ctggttccct ctgagcttgg 360 

gaggactatg tgagacagaa cagn 384 



<210> 229 
<211> 292 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (292) 
<223> n = A,T,C or G 



<400> 229 

ggtgtctctc tctcgggggg gccccccctc tctctatttt tttttgcgcg cacactcact 60 

ctctctctct tttttccccc gcgcgcgcgc acgcgctttt tttttctttt ttctnnnnnn 120 

nnnnnactct ctctcttttc tcttttgtgt gggggtctcc ggcgcgcttc tctctctctc 180 

tctcacccac agacactctc tctgtgtgcg cacctctctc tctcgggggg ccggatctct 240 

ctcccccctc tctatctctg ttattttggg ggtcccctcc gcgctctcct ca 292 



<210> 230 
<211> 400 
<212> DNA 
<213> Homo sapiens 



<400> 230 

ggcacgaggt gggacagaag tagaagaggg 
ggatgggagt catcgctgtg gagcaacctt 
tcactgtttt acaacatata agaaccctgc 



tgaatggccc tggcaggcta gcctgcagtg 60 
aattaatgcc acatggcttg tgagtgctgc 120 
cagatggact gcttcctttg gagtaacaat 18 0 
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aaaaccttcg aaaatgaaac ggggtctccg gagaataatt gtccatgaaa aatacaaaca 240 

cccatcacat gactatgata tttctcttgc agagctttct agccctgttc cctacacaaa 300 

tgcagtacat agagtttgtc tccctgatgc atcctgtgag tttcaaccag gtgatgtgat 360 

gtttgtgaca ggatttggag cactgaaaaa tgatggttac 400 

<210> 231 
<211> 332 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (332) 
<223> n = A,T,C or G 



<400> 231 

tatatagaca 

tgtgtctctc 

tctctctagc 

nnggagactc 

tccctcgcgc 

ccctcccctc 



ccccgccttt tttctctctc tctctataca cacaccgtct ctctcccccg 60 

ccctctcttt tgctcatact tatatacatc tacacacttg tgtgggggac 12 0 

gctccctctc ttttgtgtgg gcgctctcac acacacacac nnnnnnnnnn 180 

ctttctctgt ggagaatatg tgtgcgcacc atctctctct ctcttatttt 240 

gcgcgctctg tgagagagac tctctgttct cacacatatg atatatatat 300 

tctcacactc gtgccccgcg cn 332 



<210> 232 
<211> 407 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (407) 
<223> n = A, T, C or G 



<400> 232 

ggcacgagaa 

cgcttcctcc 

ccctgcacgt 

gacccgcatc 

cccaggggga 

cagcccagcg 

cgaacgcctc 



ctccggctac 
aagaagccct 
cttggttgct 
ccacccccga 
gctccggaaa 
ggtcccagcc 
tgancagtgt 



gtttgctgtc 
ccctgatgag 
ctcatcactg 
tgctggggcc 
tgtctgctgg 
tcgttttcca 
gtacctacgt 



ccaacaaata 
ttgagccact 
taatgatcta 
gctcttggat 
gggctcggaa 
gccctcactc 
gtcaggccca 



gaccagggtt 
ttagtttgtg 
aaacacacgt 
tttcatgcct 
tacccacctt 
anaatggagt 
tccttcc 



ccctaagtgt 
ctcaggctca 
ctgctcatga 
gctgccagca 
tctggtaatg 
cgctctggtt 



60 
120 
180 
240 
300 
360 
407 



<210> 233 
<211> 406 
<212> DNA 
<213> Homo sapiens 



<400> 233 

ggcacgagga 

ggcacccagg 

cctgggccat 

ctgcgttgtc 

gtcccgaagc 

gtccctgccg 

ggggcccaga 



aagacccacg 
tgagcctctg 
aatccagttg 
ccatcctcca 
aggtggtgga 
gagaggccag 
aagaaagtgt 



tgctgcctca 
gtgggggtgg 
cagggacgga 
ggcatcccct 
gtactacagc 
cccaggctgc 
cccacacaac 



tgtggccgac 
gtagt caeca 
tgatctccat 
gcggcgctgg 
cacagaggcc 
acaccgtgaa 
ctctgtttgc 



atcctcagca 
ctcggctctg 
ctcgaaggtc 
ccaagtgcgt 
tgcccccgag 
aatgtggagg 
acatgg 



agtcttgccc 
gaggatgagg 
ccagaggtaa 
getggecgag 
aagcctgggt 
gcgtaaaggg 



60 
120 
180 
240 
300 
360 
406 



<210> 234 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 234 • 

ggcacgagga gggtgaatgg ccctggcagg ctagcctgca gtgggatggg agtcatcget 



60 
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gtggagcaac cttaattaat gccacatggc ttgtgagtgc tgctcactgt tttacaacat 120 

ataagaaccc tgccagatgg actgcttcct ttggagtaac aataaaacct tcgaaaatga 180 

aacggggtct ccggagaata attgtccatg aaaaatacaa acacccatca catgactatg 240 

atatttctct tgcagagctt tctagccctg ttccctacac aaatgcagta catagagttt 300 

gtctccctga tgcatcctat gagtttcaac caggtgatgt gatgtttgtg acaggatttg 360 

gagcactgaa aaatgatggt 38 0 



<210> 235 
<211> 410 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . - (410) 
<223> n = A,T,C or G 



<400> 235 

ggcacgagct 

ccagggttcc 

agtttgtgct 

acacacgtct 

tcatgcctgc 

cccacctttc 

aatggagtcg 



gagcaggact 
ctaagtgtcg 
caggctcacc 
gctcatgaga 
tgccagcacc 
tggtaatgca 
ctctggttcg 



tagaggaact 
cttcctccaa 
ctgcacgtct 
cccgcatccc 
ca gggggagc 
gcccagcggg 
aacgcctctg 



ccggctacgt 
gaagccctcc 
tggttgctct 
acccccgatg 
tccggaaatg 
tcccagcctc 
acaagtgtgt 



ttgctgtccc 
ctgatgagtt 
catcactgta 
ctggggccgc 
tctgctgggg 
gttntccagc 
acctacgtgt 



aacaaataga 
gagccacttt 
atgatctaaa 
tcttggattt 
gctcggaata 
cctcactcan 



60 
120 
180 
240 
300 
360 
410 



<210> 236 
<211> 394 
<212> DNA 
<213> Homo sapiens 



<400> 236 

ggcacgagac 

gcttcctcca 

cctgcacgtc 

acccgcatcc 

ccagggggag 

agcccagcgg 

gaacgcctct 



tccggctacg 
agaagccctc 
ttggttgctc 
cacccccgat 
ctccggaaat 
gtcccagcct 
gacaagtgtg 



tttgctgtcc 
cctgatgagt 
tcatcactgt 
gctggggccg 
gtctgctggg 
cgttttccag 
tacctacgtg 



caacaaatag 
tgagccactt 
aatgatctaa 
ctcttggatt 
ggctcggaat 
ccctcactca 
tcag 



accagggttc 
tagtttgtgc 
aacacacgtc 
ttcatgcctg 
acccaccttt 
aaatggagtc 



cctaagtgtc 
tcaggctcac 
tgctcatgag 
ctgccagcac 
ctggtaatgc 
gctctggttc 



60 
120 
180 
240 
300 
360 
394 



<210> 237 
<211> 428 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (428) 
<223> n - A,T,C or G 



<400> 237 

ttcggcacga 

acctcagccc 

tccatcctga 

tctgctgctt 

gggcatgacc 

caactcagcc 

cctgcacgag 

gaatggcc 



nnnaagaaga 
actgagcagg 
ctgctcctcc 
ctgctgctgg 
tcatcacagt 
atgaaaaaca 
cctttctcca 



ggccctcaga 
agtcacagca 
taagagagat 
ggctgtgggt 
ggtttaaaat 
ttaacaagca 
gtgtggccgc 



gatctgacag 
cgaagaccaa 
ggcaccggcc 
ggcagagatc 
tcagcacatg 
cacaaaacgg 
cacctgccag 



cctatgagtg 
gcgcaaagcg 
agagcaggat 
ccagtcagtg 
cagcccagcc 
tgcaaagacc 
acccccaaaa 



cgtggacacc 
acccctgccc 
tctgccccct 
ccaagcccaa 
ctcaagcatg 
tcaacacctt 
tagcctgcaa 



60 
120 
180 
240 
300 
360 
420 
428 



<210> 238 
<211> 432 
<212> DNA 
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<213> Homo sapiens 



<400> 238 

tctcatggag 

tttaatacat 

agcctgcggc 

ccagcctgac 

aacgcaatgc 

ttgccaaata 

ccactgagaa 

aggataccct 



gaacccatcc 
ttactggagc 
tgtagatgtg 
tcacaaggag 
atatgggggc 
tgccactacc 
tttccagcgc 
gc 



attcgaattc 
cagaaaccta 
gaactcacag 
atagcgaagg 
ttctcttcta 
gcctacgtgc 
acattcaaca 



ggcacgagga 
ggccatcatc 
catatgcatt 
ccactagcat 
ctcacgatac 
catctgagga 
tacagccagc 



tcaactggct 


atcatatctg 


60 


gaacgccagc 


ccttggtctg 


120 


gttggcccag 


cttaccaagc 


180 


ataggcttgg 


ttggccaagc 


240 


tgtagttgct 


gtacaagctc 


300 


gatcaacctg 


gttgtaaaat 


360 


taacagattg 


gtatttcagc 


420 






432 



<210> 239 
<211> 373 
<212> DNA 
<213> Homo sapiens 



<400> 239 

ggcacgaggc 

atgaccaacc 

ggcatgtcag 

cgcgacgaga 

ttacagcagc 

gtggtggagg 

cagaccaagg 



aggacctcct 
tctcccacaa 
agcgggagcg 
tccgcgccaa 
agcagacacg 
cccaggggaa 
age 



ctcccagatc 
ggatgtcaac 
acaggtgatg 
ggacagggag 
gctgatgaag 
agccctgatc 



gcccagctgc 
ttctcagagg 
aacaagctga 
ctgggcctga 
atcaaccatg 
gaacagaagg 



aggaggagaa 
aggagttcca 
aggaggtggt 
aaaatgagga 
accttcggca 
tggagctgga 



caagcagctc 
gaagcatgaa 
ggacaaacaa 
cgttgaggct 
ccgggtcacg 
ggcagacctg 



<210> 240 
<211> 392 
<212> DNA 
<213> Homo sapiens 



<400> 240 

ggcacgagag 

egagegggtc 

gcgcgtgctg 

cgatctgcgc 

gcagctcatg 

gcatgaaggc 

caaacaacgc 

<210> 241 
<211> 434 
<212> DNA 
<213> Homo 

<400> 241 

gatcccatcc 

gctgacccaa 

gtcactgagg 

ggtgggacag 

gaaaatgggg 

aatggccaag 

ggctatgctg 

catcgaggtg 

<210> 242 
<211> 385 
<212> DNA 
<213> Homo 

<400> 242 

ggcacgagga 

cattegcaaa 



ctgaccgaga 
attgaccagc 
gagatcttgg 
ctggagcagg 
accaacctct 
atgtcagagc 
gacgagatcc 



sapiens 



attcgaattc 
gtaggaagee 
aagagatcaa 
atcagcagta 
ctgcggccct 
acctaaagaa 
tgtctctgag 
aagg 



sapiens 



tggacgtttt 
acgggtgtta 
aggtgctggt 
acctcctctc 
cccacaagga 
gggagegaca 
gcgccaagga 



ctacatcgcg 
ggccatcgcg 
cagtcgcctc 
ccagatcgcc 
tgtcaacttc 
ggtgatgaag 
eg 



tcgcttgtgg 
cgcctcatgc 
cacgtcgcgc 
cagctgetgg 
tcagaggagg 
aagctgaagg 



gecacgagtt 
ecaaggtegt 
ccgagctgga 
aggagaacaa 
agttccagaa 
aggtggtgga 



ggcacgagga 
ggacgagctg 
tcttaccaga 
tgtctccaac 
ggatgggcgg 
cctgctgcac 
agtgcagcac 



ttgattcacc 
taaaacatga 
gggccctcag 
gacagtggca 
ctccaggagg 
caggatgetg 
aggttacagg 



ttcacctgtg 
aeggaagagt 
ggctgggctt 
tetaegtcag 
gtgataagat 
tagacctctt 
tgcagaatgg 



ctgcactcca 
ggattatttg 
caacatcgtc 
ccgcatcaaa 
ecttteggta 
tegtaatgea 
acctatagga 



60 
120 
180 
240 
300 
360 
373 



60 
120 
180 
240 
300 
360 
392 



60 
120 
180 
240 
300 
360 
420 
434 



gagegeggae acctcctcaa cccactgaac aggagtcaca gcacgatgac 
gcgacccctg ccctccatcc tgactgctcc tcctaagaga gatggcaccg 



60 
120 
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gccaaaacag gattatgccc ccttctgctg cttctgctgc tgccgctgag tgtggcagag 18 0 

atcccactca gtgccaaacc caagggcatg acctcatcac agtggtttag aattcagcac 240 

atgcagccca gccctcaagc atgcaactca gccatgaaaa acattaacaa gcacacaaaa 300 

cggtgcaaag acctcaacac cttcctgcac gagcctttct ccagtgtggc cgccacctgc 360 

cagaccccca aaatagcctg caaga 385 

<210> 243 
<211> 388 
<212> DNA 
<213> Homo sapiens 



<400> 243 

ggcacgagag 

atccccgatg 

ggtgcgtctc 

tggctacgaa 

cttcagttag 

gcaggactgc 

caaattagcg 



aaggcctgcg 
gcactcccaa 
aggaagctgc 
aacttgctgg 
aggaggaacg 
agaatggctc 
aatacaaatt 



gcaaagagat 
tggtgatgtc 
tcaggtcttg 
agagaaggaa 
acagaaatgc 
agacttgcag 
taagcttg 



gagcttattg 
agtcatgaac 
gagtcaccag 
gaactactgt 
tccatgactg 
gtcatcgaaa 



acaaacatgg 
cagtggctgg 
gagaagggcc 
cacagattac 
atggcacagt 
tgcagagaga 



cttagttata 
agccatcact 
attacatgtt 
aaaactgaag 
gggtgacctg 
tgccaataga 



60 
120 
180 
240 
300 
360 
388 



<210> 244 
<211> 388 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (388) 
<223> n = A,T,C or G 



<400> 244 

ggcacgaggt 

acatcgccgg 

gcatcaaaga 

tttcggtaaa 

gtaatgcagg 

ctataggaca 

cgnggctggc 



cactgttgaa 
tgggacagat 
aaatggggct 
tggccaagac 
ctatgctgtg 
tcgaggtgaa 
Ctctccctgg 



gagttcaatc 
taccagtatg 
gcggccctgg 
ctaaagaacc 
tctctgagag 
ggggacccaa 
tattcgcg 



ttaccagagg 
tctccaacga 
atgggcggct 
tgctgcacca 
tgcagcacag 
gcggattccc 



gccctcaggg 
cagtggcatc 
ccaggagggg 
ggatgctgaa 
gttacaggcg 
atatttattg 



ctgggcttca 
tacgtcagcc 
gataagatcc 
cacctctttc 
cagaatgtac 
tgctggtgcc 



60 
120 
180 
240 
300 
360 
388 



<210> 245 
<211> 390 
<212> DNA 
<213> Homo sapiens 



<400> 245 

ggcacgaggc 

cctgttccct 

ctccagctcg 

ctacagcccc 

cctgccctcc 

cagcccggcc 

ctcctccagc 



tgtgtgtctc 
tacccctaca 
gtgcaccgcc 
tactccatcc 
atggcggcgg 
tcggtggcag 
tccatgtcct 



ttttctcacc 
cgtacatggc 
accccttcct 
cggtgccggt 
ccgcggggcc 
aggactcggg 
tgtcgcccag 



ccagggcctg 
cgcagcggcg 
caatctgaaq 
cccggacggc 
cctggacggc 
ctctgaactc 



gccatgtccc 
gccgcctcct 
accatgcgcc 
agcagtctgc 
aaagtcgccg 
aacagacgct 



ctttgggaag 
ctgcggcagc 
cgcggctgcg 
tcaccaccgc 
ccctggccgc 
cctccacgct 



60 
120 
180 
240 
300 
360 
390 



<210> 246 
<211> 397 
<212> DNA 
<213> Homo sapiens 



<400> 246 

ggcacgagac cactgggacc tcctgctcct cgccatcatc aacacagggc tgtctctgtt 60 

tgggctgcct tggatccatg ccgcctaccc ccactccccg ctgcacgtgc gagccctggc 120 

cttagtggag gagcgtgtgg agaacggaca catctatgac acgattgtga acgtgaagga 180 

gacgcggctg acctcgctgg gcgccagcgt cctggtgggc ctgtccctgt tgctgctgcc 240 
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ggtcccgctt cagtggatcc ccaagcccgt gctctatggc ctcttcctct acatcgcgct 300 
cacctccctc gatggcaacc agctcgtcca gcgcgtggcc ctggtggttc aggaaccaaa 3 60 

ctggggaacc ccccgacaca ctacatcccg gaggggg 397 

<210> 247 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (471) 
<223> n = A,T,C or G 



<400> 247 

ttacggcgcg 

tcgctgatat 

ctaatttttc 

cctgcatcat 

aaaataaaac 

tgaaaaaaga 

nnnnnnnnnn 

nnnnnnnnnn 



tttgttaggg 
ctttctttta 
ctggctgtta 
ttgtaggtgg 
ctgcaaattc 
aaacnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gaccccaccg 
ctaaatgcca 
cacaagaacc 
gtgtcattgt 
acatcttccc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



attcgaattc 
ccatccttac 
cggattttag 
gactggctgc 
catgcttcca 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



ggcacgagct 
ctgttcgggt 
ttgaactctg 
tacctcccca 
gagaatgcat 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnggat 



ctttttattt 
gtctgcgtgc 
gagcaaaaat 
tgagtcttct 
attcttcctt 
nnnnnnnnnn 
nnnnnnnnnn 

g 



60 
120 
180 
240 
300 
360 
420 
471 



<210> 248 
<211> 403 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (403) 
<223> n = A,T,C or G 



<400> 248 

ggcacgaggt 

nnnnnnnnnn 

nnnnnnnnga 

gggattttag 

ggagatgttc 

gccangcccg 

gagaaagtgg 



acagacatct 
nnnnnnnnnn 
gtagaggatg 
ctacgtagag 
tggaatatgt 
gtttcctgtc 
tcccggggaa 



agttggcagg 
nnnnnnnnnn 
cctggtatga 
acacttgaaa 
gaatgagggg 
tttcancctc 
ggcgctttat 



agccaaagat 
nnnnnnnnnn 
ggcaatattt 
attggaggga 
agtggagggg 
ttccaggaaa 
gaagagcttg 



gttgccaaac 
nnnnnnnnnn 
gggataggga 
ggaaaggagt 
ncctgnnngc 
attacgggca 
gtg 



atgtagtann 
nnnnnnnnnn 
agggaagctt 
gggtggcttt 
tctgnggaag 
gaaagaggct 



60 
120 
180 
240 
300 
360 
403 



<210> 249 
<211> 316 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (316) 
<223> n = A,T r C or G 



<400> 249 

ccgcttaaag 

ctaattggat 

gaactcaqgg 

acttcctgct 

ttgctcactc 

gataaactat 



gcgccttctt 
caatcttgat 
cacaaaacat 
gtaaccagaa 
qctcactcac 
atcaga 



ttaatgcaat 
atactctacc 
tgggccgaga 
gccagtttta 
cgaacanaac 



cattttgaac 
taaagacagt 
acggaattct 
tctatcggct 
ctgctacctc 



atgtgcgaca 
ctagaaacct 
ctgtaagcct 
actgaaacac 
cgcatgaatc 



gtcgagaata 
gggggagaaa 
agttgctgaa 
ccactgtgtg 
tactagtgcc 



60 
120 
180 
240 
300 
316 



<210> 250 
<211> 419 
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<212> DNA 

<213> Homo sapiens 



<400> 250 

ggcacgagat atcagtcaag ggctcttcaa gacacagcag aaacctcacc gggcctcggg 60 

ctgcctccca ctgggtccca tggccaccac cttgaccttg gaaagctctg ttatatggaa 12 0 

ggtagggagg acactatttc cctcaactac ttctagtaaa aagctcagtt ctctccccag 180 

cagcaagagg gcacctgtga acacctgagt cacagcgcat tcctcctctg cttagaacat 240 

tcgatggctc ccaccttact tgcagtaaat gctgaggtcc ttcctgtggc ccccggggcc 300 

ctgcatgatc tgatccatcc cttacctacc ctcatctctc cactggcctc cccacacttg 360 

ctcccctccg gacactctgg actacttgct gctatctgaa cataccaggc ccctgcccc 419 



<210> 251 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (434) 
<223> n « A,T,C or G 



<400> 251 

ggcacgaggg ggcctccacc ggtgactcgg gcctggattc cacggccatg gcctctgccg 60 

ctgcggcgca gggactgtcc ggggcgtccg cggacaccct gcccttccac ctccagcagc 120 

acgtcctggc ctctcagggc ctggccatgt cccctttcgg aagcctgttc ccttacccct 180 

acacgtacat ggacgcagcg gcggccgcct cctctgcggc agcctccagc tcggtgcacc 240 

gccacccctt cctcaatctg aacaccatgc gcccgcggct gcgctacagc ccctactcca 300 

tcccggtgcc ggtcccggac ggcagcagac tgctcaccac cgccctgccc tccatggcgg 360 

cggccgcggn gcccctggac ggcaaagacg ccgccctggc cgccagcccg gcctcggagg 420 

cagtggactc ggcg 434 



<210> 252 
<211> 425 
<212> DNA 
<213> Homo sapiens 



<400> 252 

ggcacgagaa agcactcagc ctggggaatg aactctgcca caatgatgat ggctgtgacc 60 

actccccgca gagagttctt gaagaggagc tcggcaggga ctggcaggcc aaggtggcct 120 

ccttggagga ggtgcccttt gccgctgcct caattgggca ggtgcaccag ggcctgctga 18 0 

gggacgggac ggaggtggcc gtgaagatcc aggtgagagg ggaggctggg cagggtaggg 240 

gcgggcaccc tgctagccca gagaagtgac tcccaccttc tctccctccc ttctcccttt 300 

acagtacccc ggcatagccc agagcattca gagcgatgtc cagaacctgc tggcggtact 360 

caagatgagc gcggccctgc ccgcgggcct gtttgccgag cagagcctgc aggccttgca 420 

gcagg 425 



<210> 253 
<211> 395 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (395) 
<223> n = A,T,C or G 



<400> 253 

ggcacgagca gacatctagt tggcaggagc caaagatgtt gccaaacatg tagtannnnn 60 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 120 

nnnnngagta gaggatgcct ggtatgaggc aatatttggg atagggaagg gaagcttggg 18 0 

attttagcta cgtagagaca cttgaaaatt ggagggagga aaggagtggg tggctttgga 24 0 

gatgttotgg aatatgtgaa tgaggggagt ggaggggtcc tggaggctct ggggaaggcc 300 
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aagcccgttt tcctgtcttt caacctcttc caggaaaatt acgggcagaa ggaggctgag 360 
aaagtggccc gggtgaatgc gctatatgac gagct 395 

<210> 254 
<211> 307 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (307) 
<223> n = A,T,C or G 



<400> 254 

agtcgtcttc 

atcaatcttg 

ggcacaaaat 

ctgtaaccag 

tccctcactc 

ttctcan 



ttttaatgta 
aaatactcaa 
attggtctga 
aagccagttt 
accgatcaaa 



atcattttga 
ccaaaagaca 
gaatggaatt 
tatctaacgg 
acctgctacc 



acatgtgtga 
gtcgagaagc 
ctctgtaagc 
ctactgaaac 
tccccaagac 



aagttgatca 
cagggggaga 
ctagttgctg 
acccactgtg 
tttactagtg 



tacgaattgg 
aagaactcag 
aaatttcctg 
ttttgctcac 
ccgataaact 



60 
120 
180 
240 
300 
307 



<210> 255 
<211> 312 
<212> DNA 
<213> Homo sapiens 



<400> 255 

agtcgtcttc 

atcaatcttg 

ggcacaaaat 

ctgtaaccag 

tccctcactc 

ttctcaaaga 



ttttaatgta 
aaatactcaa 
attggtctga 
aagccagttt 
accgatcaaa 
gc 



atcattttga 
ccaaaagaca 
gaatggaatt 
tatctaacgg 
acctgctacc 



acatgtgtga 
gtcgagaagc 
ctctgtaagc 
ctactgaaac 
tccccaagac 



aagttgatca 
cagggggaga 
ctagttgctg 
acccactgtg 
tttactagtg 



tacgaattgg 
aagaactcag 
aaatttcctg 
ttttgctcac 
ccgataaact 



60 
120 
180 
240 
300 
312 



<210> 256 
<211> 415 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> rnisc_f eature 
<222> (1) . . . (415) 
<223> n = A,T,C or G 



<400> 256 

ggcacgagca 

tggaaaaggc 

ccccggggag 

gctgggctgg 

ctaacgaggg 

aagcgagtgt 

tagccagtgc 



ggagcagctg 
agcgcgtgcc 
gcaggaccgg 
gagccccagg 
cttcagccca 
ctgatccccc 
tccagcccct 



gcaagggaga 
tggcgccaaa 
gagaggcaga 
ggaggcagag 
cccggcccac 
tgctggccca 
gccctgcagg 



aggacacggt 
gcagggcggg 
gctgggcgga 
ctgggcggag 
cacccaccct 
ggtcctcact 
atgatggttt 



gaagatgctg 
aggagtcgag 
gtcgtcaagc 
gtagtgggga 
tctggggttc 
ttgcacctgt 
cccctcagct 



caggaacagc 
ctgccgggag 
tgctgggagc 
cagagacttc 
ccttgctggg 
gtgggcccct 
cccan 



60 
120 
180 
240 
300 
360 
415 



<210> 257 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 257 

agaaagggtg agtgaggtgc tgtcctgggg ttctccaagt ttgagagcat ggatgcatgt 60 

ggtttgaagc tgaagtgggc ctgggggaat gggttgaagg cagaagcaac cagtttggag 120 

ggaaggcatt tggatatcca gccctttctc tgtggccttg gccctgggtc tgtcctgtta 18 0 

cccccaccca tacctgtctg ctgcgcactc tgtgcttctg tagcattctc gcttctggcc 240 
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tttaaagttg gcaaggggag gttaataagc acctaggtgg ctgagtgtct ctgtcttctg 300 
gcttgttcac aggacttcga gtaagaaggt gatttacagc cagcctagtg cccgaagtga 360 
aggagaattc aaacagacct cgtcattcct ggtgtg 396 

<210> 258 
<211> 431 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . . (431) 
<223> n = A,T, C or G 



<400> 258 

gnnggagggc 

cgatggcact 

gtctcaggaa 

acgaaaactt 

gttagaggag 

actgcagaat 

tagcgaatac 

tattagccgg 



ctgcggcaaa 
cccaatggtg 
gctgctcagg 
gctggagaga 
gaacgacaga 
ggctcagact 
aaatttaagc 



gagatgagct 
atgtcagtca 
tcttggagtc 
aggaagaact 
aatgctccag 
tgcagttcat 
tttcaaaagc 



tattgagaaa 
tgaaccagtg 
agcaggagaa 
actgtcacag 
gaatgatggc 
cgaaatgcag 
agaacaggat 



catggcttag 
gctggagcca 
gggccattag 
attagaaaac 
acagtgggtg 
agagatgcca 
ataactacct 



ttataatccc 
tcactgttgt 
atgtaaggct 
tgaagcttca 
acctggcagg 
atagacaaat 
tggagcaaag 



60 
120 
180 
240 
300 
360 
420 
431 



<210> 259 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (404) 
<223> n = A,T,C or G 



<400> 259 

ggcacgagca 

tggaaaaggc 

ccccggggag 

gctgggctgg 

ctaacgaggg 

aagcgagtgt 

tagccagtgc 



ggagcagctg 
agcgcgtgcc 
gcaggaccgg 
gagccccagg 
cttcagccca 
ctgatccccc 
tccagcccct 



gcaagggaga 
tggcgccaaa 
gagaggcaga 
ggaggcagag 
cccggcccac 
tgctggccca 
gccctgcagg 



aggacacggt 
gcagggcggg 
gctgggcgga 
ctgggcggag 
cacccaccct 
ggtcctcact 
atgatggttt 



gaagaagctg 
aggagtcgag 
gtcgtcaagc 
gtagtgggga 
tctggggttc 
ttgcacctgt 
cccn 



caggaacagc 
ctgccgggag 
tgctgggagc 
cagagacttc 
ccttgctggg 
gtgggcccct 



60 
120 
180 
240 
300 
360 
404 



<210> 260 
<211> 402 
<212> DNA 
<213> Homo sapiens 



<400> 260 

ggcacgagat 

ggagggagtc 

aggctgggag 

ccaggcttga 

cgacccaccc 

cactgctcaa 

ggctctggtg 



ctccctgcct 
taccgggcca 
ctggagagcc 
gggtagtagg 
tgtccggccc 
cctactgcag 
cgtgggggcc 



tgtgagcagc 
ctgtagagct 
ctgggtgaga 
tgctggaggc 
ccagctcgga 
gggctgggga 
gcagggagtc 



tggccggcgg 
ggtagctggg 
gggaggccta 
agagcctggc 
ggagcttcta 
gtgagctgct 
tgcctcgtgg 



ctctgggaca 
agctggagct 
taggggcccc 
ctgtccaggg 
cgtgtatgcg 
gtgcttcgac 

gg 



ggcggggatg 
gtagagttcc 
gggggacaca 
tgggacctca 
ggcatcctgg 
atcatcgagg 



60 
120 
180 
240 
300 
360 
402 



<210> 261 
<211> 402 
<212> DNA 
<213> Homo 



sapiens 



<400> 261 
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ggcacgagat 
ggagggagtc 
aggctgggag 
ccaggcttga 
cgacccaccc 
cactgctcaa 
ggctctggtg 



ctccctgcct 
taccgggcca 
ctggagagcc 
gggtagtagg 
tgtccggccc 
cctactgcag 
cgtgggggcc 



tgtgagcagc 
ctgtagagct 
ctgggtgaga 
tgctggaggc 
ccagctcgga 
gggctgggga 
gcagggtgtc 



tggccggcgg 
ggtagctggg 
gggaggccta 
agagcctggc 
ggagcttcta 
gtgtgctgct 
tgcctcgtgg 



ctctgggaca 
agctggagct 
gaggggcccc 
ctgtccaggg 
cgtgtatgcg 
gtgcttcgac 

gg 



ggcggggatg 
gtagagttcc 
gggggacaca 
tgggacctca 
ggcatcctgg 
atcatcgagg 



<210> 262 
<211> 151 
<212> DNA 
<213> Homo sapiens 

<400> 262 

gccgaatatg aagctacgtc cgggtatccg ggttccctgt aattgctttc tgatccctgg 
tacttagatt tgattaccta tggaccacat tggtagaact actatatggg ggaacctcct 
gattttgggc ggtctcaaaa acaaaaaaaa c 

<210> 263 
<211> 404 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
402 



60 
120 
151 



<220> 

<221> misc_feature 
<222> (1) . . . (404) 
<223> n = A, T, C or G 



<400> 263 

ggcacgaggg aacgtggaag gactagactg cctgagtctt ctgannnnnn nnnnnnnnnn 60 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnntt 120 

ggacctaccc cccgagtggt ttgccagggg ctctcaggcc ttcggctaca gactgagggc 18 0 

tgcattatca gctttcctac ttttgaggtt ttgggacttt actggctttc ttgctcctca 24 0 

acttgcagat ggcctgttgt gggacctcac cttgtgatca tgtacatgag ggaaatacac 300 

acccctccca gggatgatgg aaggttaagg tcctaacacc tcctgcacat ctgagcagct 360 

gcacattgaa ccagatagtc ctggaatgtg ggaaaacaga ggcn 404 



<210> 264 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 264 

ggcacgaggg gaacgggaag ccgggaccca gaactcttgt ctttcaggat aaagtggcca 60 

gggtgtacga agccccgggc tttttcctgg acctggagcc catcccggga gccttggacg 12 0 

ctgtgcggga gatgaacgac ctaccggaca cgcaggtctt catctgcacc agccccctgc 180 

tgaagtacca ccactgtgtg ggtgagaagt accgctgggt ggagcagcac ctggggcccc 240 

agttcgtaga acgaattatc ctgacaaggg acaagacggt ggtcttgggg gacctgctca 300 

ttgatgacaa ggacacagct cgaggccagg aggagacccc aagctgggag cacatcttgt 360 

tcacctgctg ccacaatcgg 3 80 



<210> 265 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (440) 
<223> n = A,T,C or G 

<400> 265 

ggcagaggcg tggacaccac ctcagcccac tgagcaggag tcacagcacg aagaccaagc 60 
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gcaaagcgac ccctgccctc catcctgact gctcctccta agagagatgg caccggccag 12 0 

agcaggattc tgcccccttc tgctgcttct gctgctgggg ctgtgggtgg cagagatccc 18 0 

agtcagtgcc aagcccaagg gcatgacctc atcacagtgg tttaaaattc agcacatgca 240 

gcccagccct caagcatgca actcagccat gaaaaacatt aacaagcaca caaaacggtg 300 

caaagacctc aacaccttcc tgcacgagcc tttctccagt gtggccgcca cctgccagac 360 

ccccaaaata gcctgcaaga atggcgataa aaactgccac caaagccacg ggcccgtgtt . 420 

cctgaccatg tgaagctccn 440 



<210> 266 
<211> 396 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (396) 
<223> n = A,T,C or G 



<400> 266 

gcacgaggag 

nnnnnnnnnn 

tggacctacc 

ctgcattatc 

aacttgcaga 

cacccctccc 

tgcacattga 



gaacgtggaa 
nnnnnnnnnn 
ccccgagtgg 
agctttccta 
tggcctgttg 
agggatgatg 
accagatagt 



ggactagact 
nnnnnnnnnn 
tttgccaggg 
cttttgaggt 
tgggacctca 
gaagggtaag 
cctggaatgt 



gcctgagtct 
nnnnnnnnnn 
gctctcaggc 
tttgggactt 
ccttgtgatc 
gtcctaacac 
gggaac 



tctgannnnn 
nnnnnnnnnn 
cttcggctac 
tactggcttt 
atgtacatga 
ctcctgcaca 



nnnnnnnnnn 
nnnnnnnnnt 
agactgaggg 
cttgctcctc 
gggaaataca 
tctgagcagc 



60 
120 
180 
240 
300 
360 
396 



<210> 267 
<211> 429 
<212> DNA 
<213> Homo sapiens 



<400> 267 

ggcacgagga 

tcacagcacg 

agagagatgg 

ctgtgggtgg 

tttaaaattc 

aacaagcaca 

gtggccgcca 

cagagccac 



tctgacagcc 
aagaccaagc 
caccggccag 
cagagatccc 
agcacatgca 
caaaacggtg 
cctgccagac 



taggagtgcg 
gcaaagcgac 
agcaggattc 
agtcagtgcc 
gcccagccct 
caaagacctc 
ccccaaaata 



tggacaccac 
ccctgccctc 
tgcccccttc 
aagcccaagg 
caagcatgca 
aacaccttcc 
gcctgcaaga 



ctcagcccac 
catcctgact 
tgctgcttct 
gcatgacctc 
actcagccat 
tgcacgagcc 
atggcgataa 



tgagcaggag 
gctcctccta 
gctgctgggg 
atcacagtgg 
gaaaaacatt 
tttctccagt 
aaactgccac 



60 
120 
180 
240 
300 
360 
420 
429 



<210> 268 
<211> 405 
<212> DNA 
<213> Homo sapiens 



<400> 268 

ggcacgaggc 

taaagtggcc 

agccttggac 

cagccccctg 

cctggggccc 

ggacctgctc 

gcacatcttg 



ggcttcctgg 
agtgtgtacg 
gctgtgcggg 
ctgaagtacc 
cagttcgtag 
attgatgaca 
ttcacctgct 



cccgcgagca 
aagccccggg 
agatgaacga 
accactgtgt 
aacgaattat 
aggacacagt 
gccacaatcg 



gtaccgcgcc 
ctttttcctg 
cctaccggac 
gggtgagaag 
cctgacaagg 
tcgaggccag 
gcacctggcc 



ctgcggcccg 
gacctggagc 
acgcaggtct 
taccgctggg 
gacaagacgg 
gaggagaccc 
tgccc 



acctggcgga 
ccatcccggg 
tcatctgcac 
tggagcagca 
tggtcttggg 
caagctggga 



60 
120 
180 
240 
300 
360 
405 



<210> 269 
<211> 372 
<212> DNA 
<213> Homo sapiens 



<400> 269 

ggcacgagaa ccctgaggcc tggctatggt accaccgggt ggtaggtgcc cagcgctgcc 



60 
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ccatcgtgga caccttctgg caaacagaga caggtggcca catgttgact ccccttcctg 12 0 

gtgccacacc catgaaaccc ggttctgcta ctttcccatt ctttggtgta gctcctgcaa 180 

tcctgaatga gtccggggaa gagttggaag gcgaagctga aggttatctg gctgccagcg 24 0 

ggaccaggat ggctattact ggatcactgg caggattgat gacatgctca atgtatctgg 300 

acacctgctg agtacagcag aggtggagtc agcacttgtg gaacatgagg ctgttgcata 360 

ggcacctgtg gg 372 



<210> 270 
<211> 411 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (411) 
<223> n - A,T,C or G 



<400> 270 

ggcacgagag ctctcggcgc acggcccagc ttccttcaaa atgtctactg ttcacgaaat 60 

cctgtgcaag ctcagcttgg agggtgattg tccaggaagt tattccagat gaagacttat 12 0 

acnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnniinn nnnnnnnnnn 18 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn n 411 



<210> 271 
<211> 302 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (302) 
<223> n = A,T,C or G 



<400> 271 

ctgagtgtga cactcagaga gtgtgttata tagacagaga gagagcgcgc gcctctgtcc 60 

ccccccttgt gtgtgcccca ctccagtgcg cccagatccg tgcccccccc cggagcgccg 120 

tgctccctnn nnnnnnnnag tgtgcacacc cccctccccc tctcatgagt gcccacatat 18 0 

atattcctgt gtgacccctc cccccccctg ccagtcagtg tccccgccgg agcgcgagtc 240 

actgttttat tttttctcgc ccccaagaag ggatagcgat gtgtctctcc cctcctccca 300 

ca ' 302 



<210> 272 
<211> 429 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . - . (429) 
<223> n = A,T,C or G 



<400> 272 

ggcacgagat gtggtacaga catctagttg gcaggagcca aagatgttgc caaacatgta 60 

gtannnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12 0 

nnnnnnnnnn nnngagtaga ggatgcctgg tatgaggcaa tatttgggat agggaaggga 18 0 

agcttgggat tttagctacg tagagacact tgaaaattgg agggaggaaa ggagtgggtg 24 0 

gctttggaga tgttctggaa tatgtgaatg aggggaagtg gaggggcctg gaggctctgg 300 

ggaaggccaa gcccgttttc ctgtctttca acctcttcca ggaaaattac gggcagaagg 360 

aggctgagaa agtggcccgg gtgaaggcgc tatatgagga gctggaactg tcaacagtgg 420 

tcttgcaaa ~ 42 9 
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<210> 273 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (471) 
<223> n = A,T,C or G 



<400> 273 

tgttgtgcat 

tcaaggcttt 

cccccgtgcc 

ttggggagtg 

ggatgaccct 

ggaaatccca 

caggggaaga 

tctagcccaa 



ttgggcatcc 
aagaaatgag 
ctgaggtcat 
aggaagacag 
cccatgatgc 
tcctgccacc 
aggaaaacag 
attcattccc 



caccgattcg 
taggccaaaa 
tggccaagcc 
aagtggaagc 
tgagacccca 
cacaagactt 
cccttaagga 
atcacctggg 



aattcggcac 
tctaagtcaa 
aagccgaacc 
ccagttctca 
aaaggctaca 
caaggaaagt 
gctccagcca 
gtggaagggc 



gagaaagcat 
aggagaatct 
tgagctttga 
ccccaagagg 
cactcaagct 
tgttttggtg 
ctggccagcc 
cagaaatctc 



tgaagagacc 
gtactggggc 
tcctgatggt 
ggacacaaat 
aaaagccaga 
ctgagcagag 
ttcatgtgac 
n 



60 
120 
180 
240 
300 
360 
420 
471 



<210> 274 
<211> 391 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc^feature 
<222> (1) . . . (391) 
<223> n = A,T,C or G 



<400> 274 

ggcacgaggt aaactctcta taagtgttca gtgttgacat agcctttgtg catagnnnnn 60 

nnnnnnnnnn nnnnnnnnnn nnnnnnnttt tttgccccac ctggaaaaaa ggggatgcnn 120 

nnnnnnnngg ggggaaaaac aattcttaag ggccctttgg ccataaactt ttttccgggc 180 

cacctttgtt acttttggtc ctggaagggg tttttttggg gggcccacgg ggaggggccc 240 

cataggtaaa ctcggaaaac tttttctaac ccgggttagt gttttaaatt aaaaccaaaa 300 

annnnnnnnn nnttggaatc cttttcttta aaaaaattaa tctctcaaag gaaaacaaag 3 60 

nnnnnnnnnn nggggggccc ctttcgttta g 391 



<210> 275 
<211> 339 
<212> DNA 
<213> Homo sapiens 



<400> 275 

cactccgggg 

atataacatc 

tctgtgcgca 

aactgtctct 

gtcccccctc 

ctctctctct 



gctctatttg 
tagcgccacg 
catatagagg 
ggggggtcag 
tctggggcgc 
cactctctgt 



<210> 216 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (434) 
<223> n = A,T,C or G 



tgtgctctgc 
gtgcctatgt 
ggaaaagaga 
agaacgcgcg 
agggaggccc 
ttgcgaaaaa 



acccagtttt 
gtacacaccc 
gatatatcta 
cacccctctc 
catggcatga 
acacacagg 



ttatacactc 
tctctctata 
ttatatatac 
ttttgagaga 
agaaaaatac 



cacgctttgg 
tatagatacc 
atttctacac 
aggagactct 
tcacttatat 



60 
120 
180 
240 
300 
339 



<400> 276 
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ccctagctac ttgctctttg tgcaggatgc catagattcg tgggctcctg ccttttctca 60 

accccgaggt gcctgaccag ttctaccgcc tgtggctatc cctcttcctg cacgccggga 120 

tcttgcactg cctggtgtcc atctgcttcc agatgactgt cctgcgggac ctggagaagc 180 

tggcaggctg gcaccgcata gccatcatct acctgctgag tggtgtcacc ggcaacctgg 24 0 

ccagtgccat cttcctgcca taccgagcag aggtgggtcc tgctggctcc cagttcggca 300 

tcctggcctg cctcttcgtg gagctcttcc agagctggca gatcctggcg cggccctggc 360 

gtgccttctt caagctgctg gctggggagg cttttctctt cacctttggg ctgctgccgt 420 

ggattgacaa cttn 434 



<210> 277 
<211> 378 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (378) 
<223> n = A f T,C or G 



<400> 277 

ggcacgagaa aaagtaccgc tccagagcag gagcctaggc agccgagagg gtgcccgaac 60 

ctgagtctga gttgcggcca cttcaggagc tgagaggagc aggatggaac tgcaggatcc 12 0 

aaagatgaat ggagccctcc cttcggatgc tgtgggctac aggcaagaac gtgagggctt 180 

cctgcccagt cgtggtcctg ctcctgggag caagccggtc cagttcatgg atttcgaggg 24 0 

gaagacatcg tttggaatgt cagtgttcaa cctcagcaac gccatcatgg gcagcggcat 300 

cctggggctg gcctatgcca tgggccacac gggggtcatt ttctttctgg gcctgctgct 360 

gngccatgcg cttctgcc 378 



<210> 278 
<211> 302 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (302) 
<223> n = A,T,C or G 



<400> 278 

cccccnctct 

cggatccggg 

catggggggc 

acacatgagc 

aggggtcttt 

eg 

<210> 279 
<211> 405 
<212> DNA 
<213> Homo 

<400> 279 

ggcacgaggc 

tgtatggaag 

ggagaagega 

ccgtgccacg 

aaaaatcacc 

ggacaaccgc 

tgacccaacc 

<210> 280 
<211> 415 
<212> DNA 



cgccnnnnnn 
ttctttcttt 
tggggagttc 
ccctccttcc 
gtttgtatga 



nnnnncgttt 
cccgtggtgc 
cccagaaccc 
ggtggcgggg 
ccccaccgac 



sapxens 



ctcattggag 
cagtttgggg 
gagggctacc 
ggggatccca 
aaggtggagt 
atggagtcgt 
aacttcatcc 



tcactcccgg 
cgcgggttgg 
ccagggggaa 
acctgtctct 
gegggggget 



gagtcccctt 
agtgttttat 
acccccctcc 
ctaagaccct 
ttctatgggc 



gtttttggcc 
cttttcttca 
tatgaaaatg 
tttctgggaa 
cgcccccccc 



acattgacaa 
ggctcccgga 
cacttcggcc 
ccctcctaga 
gcggatttgc 
tcttcctggc 
acaacaatgg 



tgecatgagg 
attctacaac 
agaacttatt 
acteggaaga 
aacaatcaaa 
cgagactgtg 
gtgcaccttc 



accttcctca 
attcctcagg 
gaaagcgcaa 
gatgctgtgg 
gatctgegag 
aaatacctct 
gaege 



actactacac 
gatacacagc 
tgtacctcta 
aatccattga 
accacacgct 
acctcctgtt 



60 
120 
180 
240 
300 
302 



60 
120 
180 
240 
300 
360 
405 
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<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1) . . . (415) 
<223> n = A,T,C or G 



<400> 280 

ggcacgaggg 

ccgtgggggt 

aaagacccct 

tcatgttact 

attctgcatt 

gcttactgga 

nnnaccagcc 



tcacctgtgc 
gctctgcctt 
ggggggggtt 
gatattacat 
catgaccttt 
ttccgccttg 
cacataaaat 



tgcccctcct 
ccctggtccc 
ggananattc 
gcggacaccc 
tgataactcc 
taacaataaa 
atgattgaat 



taatctcgta 
cactgcccat 
aatcttacca 
ctggatcata 
atcatgacct 
atctatttaa 
caatttctta 



tgatggtcac 
atctgtggac 
aactcaacga 
ttattcaaat 
acttgacggt 
actnnnnnnn 
taccttcact 



agtccggtgg 
tgccccttcc 
tccatccatt 
ccagtcatct 
cactgaccat 
nnnnnnnnnn 
agaat 



60 
120 
180 
240 
300 
360 
415 



<210> 281 
<211> 389 
<212> DNA 
<213> Homo sapiens 



<400> 281 

ggcacgaggt 

gggctcacgt 

gaaaacaaac 

tgccacattt 

catttgttct 

cagatgggtt 

tcaacatcac 



agactggggg 
ggagtctgac 
tttgtaatgg 
cagggcaagg 
ttttatcttt 
gaagagctgt 
agtggatttt 



ctcactgatt 
acatgaatac 
taggaaacac 
caaaatgcag 
aagatgaagc 
tcttataaaa 
tggattggg 



gcattgacac 
atggctatca 
aacaggtaca 
aggagacata 
tcttacacca 
taagatctgg 



ttttcatcat 
tgtctgtcac 
ataatttaca 
tgttaaaatc 
agtgtcacga 
ggaacacaat 



gggtccccgg 
cttcaatggg 
aaaatatgtt 
ttatcattca 
gtctggagaa 
cctttatata 



60 
120 
180 
240 
300 
360 
389 



<210> 282 
<211> 371 
<212> DNA 
<213> Homo sapiens 



<400> 282 

ggcacgagat agaatccgag gcattgatat cattaaatgg atggagcgct accttaggga 60 

taagaccgtg atgataatcg tagcaatcag ccccaaatac aaacaggacg tggaaggcgc 12 0 

tgagtcgcag ctggacgagg atgagcatgg cttacatact aagtacattc atcgaatgat 180 

gcagattgag ttcataaaac aaggaagcat gaatttcaga ttcatccctg tgctcttccc 240 

aaatgctaag aaggagcatg tgcccacctg gcttcagaac actcatgtct acagctggcc 300 

caagaataaa aaaaacatcc tgctgcggct gctgagagag gaagagtatg tggctcctcc . 360 

acgggggcct c 371 



<210> 283 
<211> 413 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1) . . . (413) 
<223> n = A,T,C or G 



<400> 283 

ggcacgaggt gggagacacc acttgtcttt atgtgggtct caaagatgat gtagaatttc 60 

ctttaatttc tcgcagtctt cctggaaaat attttccttt gagcagcaaa tcttgtaggg 12 0 

atatcagtga aggtctctcc ctccctcctc tcctgnnnnn nnnnhnngga aacaaagttt 180 

tgcttttgtt ccccagcctg aaggggaagg gctcaatttt ggttaaccaa aaccttggcc 240 

tccggggtta aagcaattct ccggcctaac cctttggaga acctgggtta ataggcgcag 300 

gcccccaggc cgggttaatt ttgggtttta agaaaaaaca gggtttctca atgtggggca 360 

ggcgtggcca aaacccccac cctaagggga tcggccctcc ttggcctccc aan 413 
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<210> 284 
<211> 409 
<212> DNA 
<213> Homo sapiens 



<400> 284 

ggcacgaggc ctggggatgc tccctgctaa gtgggcctgc tcccaccctt gccataaagc 60 

tctgaggcag cctgagcctg ccgtgggggc cccactgtga ccctgccgca gtcttcctgg 12 0 

gtccctgcgt cctcttaagg ggcagtgaca cctgcctcgc tggccctgtg tgggtggcag 180 

gccccactgt ttgggatatc acatggccag gcacgtggtg agcctgctca gggcggacgc 24 0 

ctgcaggcgc gtgctcggtc acacactgcc ttgtgtggcc ctcctgtccg gtgcagcctg 300 

gacctggacg cctggatcaa tgagccactc tcggacagcg agtcagagga cgagaggccc 360 

agggccgtct tccacgagga ggagcagcgg cgtcccaagc accggccgt 4 09 



<210> 285 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (404) 
<223> n = A,T,C or G 



<400> 285 

ggcacgagcc acttcacccc cttgggggct gcttattcac tctggggatt cgccatggac 60 

acgtctcaac tgcgcaagct gctgcccatg tttccctgcc cctccagatt gcctggagat 12 0 

ctattttgtt tccttttgtg tttctttttc tgttttgagt gtctttcttt gcaggtttct 180 

gtagccggaa gatctccgtt ccgctcccag cggctccagt gtaaattccc cttccccctg 240 

gggaaatgca ctaccttgtt ttggggggtt taggggtgtt tttgtttttc agnngntttg 3 00 

nttttttggn nnnnnnnnnn gntttgactt ttttnncttt tattttggag ggtaatggaa 3 60 

agaataggaa aatcaggcag gggggagaat ggttgtttat tctt 4 04 



<210> 286 
<211> 441 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc^feature 
<222> (1) . . . (441) 
<223> n = A,T,C or G 



<400> 286 

ggcacgaggg 

tgtacgggtt 

ggggggcggc 

tctaatactt 

ttcattaqct 

taagtactta 

tctcacttca 

tttatataga 

<210> 287 
<211> 387 
<212> DNA 
<213> Homo 

<400> 287 

ggcacgagca 

ccatgcggag 

cgccggaact 

ccggtcgcta 



aagcgggtgg 
ctgttcctct 
gctncncnnn 
attgcccctg 
taggtcagct 
tttcctctct 
tctataaaac 
atttaagcct 



sapiens 



tgtgtgfcccc 
gagccctgcg 
caccccccan 
actccaaatc 
gttctctatt 
cattgttgct 
tcttacatat 

g 



ctgtttactt 
gcccacctga 
nggcctctat 
ttccaccttt 
atgacactga 
aatatcctct 
ctctccacta 



ttagctgagc 
tgtttacgtg 
ccttgtgaag 
tgcctcttat 
ttcatacttt 
tccttttttc 
atttctttga 



tggggttggg 
tgtgtgtgag 
ctctcctcaa 
tatatctatg 
tgttttttga 
ctttgtctac 
actaacaatt 



gccctggaat tccgcaagca cccggaggcc ggggggtctc cgcgggcgtc 
gacatggtgc gccgtgtact cttccccacg acctcaggga ccggtccccc 
gcttcctacc tggtccggtc ccggcagctg aatctggcca gcccaacctc 
tggcacccac aggcctaaca ttcgcgagtc caccttccgc cgtccgcgag 



60 
120 
180 
240 
300 
360 
420 
441 



60 
120 
180 
240 
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gaaaacctga ttggcgccct cttggcgatc ttcgggcacc tcgtggtcag cattgcactt 300 
aacctccaga agtactgcca catccgcctg gcaggctcca aagatccccg ggccttattt 360 
aaagaccaaa actggtggct tgggcct 387 

<210> 288 
<211> 439 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1) . . . (439) 
<223> n = A, T, C or G 



<400> 288 

ggcacgaggg 

ggttgggtgt 

gtgtgagggg 

taattaatcc 

atttactttt 

ctctttcgca 

agttgtttcc 

gtttgtggag 



aggctggaag 
acgggttctg 
gggcggcggn 
aatctttttt 
actcatcctc 
attttccttt 
tctttattta 
ttttttctt 



cgggtggtgt 
ttcctctgag 
nncannnnnn 
cttcttctct 
ctttcttcat 
attgttgtca 
catttggttg 



gtgtcccctg 
ccctgcggcc 
nnnnnnngsin 
tctttctact 
ttactctgtc 
ctcttttggg 
tcttctcctt 



tttactttta 
cacctgatgt 
tctttttcca 
ctttttcctc 
ttttgtatta 
aataacgtac 
tcataattta 



gctgagctgg 
ttacgtgtgt 
ataacaatat 
cttttttttt 
ctagcttctt 
tcttatgaga 
ttttacgtat 



60 
120 
180 
240 
300 
360 
420 
439 



<210> 289 
<211> 170 
<212> DNA 
<213> Homo 



sapiens 



<400> 289 
atgagtggtc 
cacaggattg 
taatgggggg 



ttttaattag gaacaaatct aatggaaagg agagttgact gaagttggcc 60 
tgagctgggc agagccttca tgaaggcttg ccaccttggg acgcccaatt 12 0 

gcctgctgta aggcaaaagg ctttttggca aattgctggg 17 0 



<210> 290 
<211> 393 
<212> DNA 
<213> Homo 



sapiens 



<400> 290 

ggcacgaggt 

gttctcacgt 

gaaaacaaac 

tgccacattt 

catttgatct 

cagatggggt 

tcaacatcac 



agactggggg 
ggagtctgac 
tttgtaatgg 
cagggcaagg 
ttttatcttt 
gagtagttgt 
agtggatttc 



ctcactgagt 
acatgaatac 
taggaaacac 
caaaatgcag 
aggatgaagc 
tcttataaat 
tggcttggtg 



gcagtgacac 
atggctatca 
aacaggtaca 
tggagacata 
tcttacacca 
tagtatctgt 
cat 



ttttcatcat 
tgtctgtcac 
ataatttaca 
tgttaaattc 
agtgtcacga 
ggaacacaat 



gggtccccgg 
cttcaatggg 
aaaatatgtt 
ttatcattca 
gtctggagaa 
cctttatata 



60 
120 
180 
240 
300 
360 
393 



<210> 291 
<211> 430 
<212> DNA 
<213> Homo 



sapiens 



<400> 291 

ggcacgaggg 

gataagaccg 

gctgagtcgc 

atgcagattg 

ccaaatgcta 

cccaagaata 

tcacgggggc 

cactgaggcc 



atagaatccg 
tgatgataat 
agctggacga 
agttcataaa 
agaaggagca 
aaaaaaacat 
ctcttgccac 



aggcattgat 
cgtagcaatc 
ggatgagcat 
acaaggaagc 
tgtgcccacc 
cctgctgcgg 
ccttcaagtg 



atcattaaat 
agccccaaat 
ggcttacata 
atgaatttca 
tggcttcaga 
ctgctgagaa 
ggtcccttgt 



ggatggagcg 
acaaacagga 
ctaagtacat 
gattcatccc 
acactcatgt 
aagaaaaaga 
gacacccgtc 



ctaccttagg 
cgtggaaggc 
tcatcgaatg 
tgtgctcttc 
ctacagctgg 
tgtggctcct 
aatcccagat 



60 
120 
180 
240 
300 
360 
420 
430 
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<210> 292 
<211> 423 
<212> DNA 
<213> Homo sapiens 



<400> 292 

atcccatcga 

gaggggaagg 

accctgcaga 

atggacacgt 

tggtgaaaca 

tgcctgtcct 

tcggaagccc 

gag 



ttcgaattcg 
cacccggggc 
aggctggctg 
ggtcctgtca 
gtgagctttc 
gctgcttggc 
gttttccctt 



gcacgaggga 
tcctgcatcg 
cagatgcccc 
gggctacagc 
ccgtgggtgc 
cgctcctcag 
caaaacacag 



agcaagggca 
agcttccctc 
tgcctcccgg 
aggtctatgg 
ttttccctga 
agctgcggcc 
tcaccaccgg 



cccgccttat 
ctatattcaa 
ctttgcctgc 
tctttggtaa 
cgccaacaac 
gggagagttc 
gatcgtgagc 



ggatggaatt 
tgaggaaatg 
ttggagtttg 
cggaaagcgc 
cagggcaagc 
gtggtcgcca 
accacccagc 



60 
12 0 
180 
240 
300 
360 
420 
423 



<210> 293 
<211> 409 
<212> DNA 
<213> Homo sapiens 



<400> 293 

ggcacgaggc 

gcagtcagag 

gcggattgtg 

cagcatccag 

ccagagcgcc 

cagggactgg 

tgggcaggtg 



taggagtact 
ggtggttctg 
cagaccttat 
gacaacagct 
gacttcatgc 
caggccaagg 
caccagggcc 



ggcctagatg 
ggctggactc 
gtacagttcg 
tcatcagccc 
cccgctggca 
tggcctcctt 
tgctgaggga 



gttatagaag 
cagccccttc 
aggggccgcc 
tcagctgcag 
gatgctgaga 
ggaggaggtg 
cgggacggat 



tccatgccag 
ctgtcggagg 
ctcaaggttg 
cgcatctttg 
gttcttgaag 
ccctttgccg 
gtgggcgtg 



gaggtcgtct 
ccaatgccga 
gccagatgct 
agcgggtccg 
aggagctcgg 
ctgcctcaat 



60 
120 
180 
240 
300 
360 
409 



<210> 294 
<211> 369 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (1) . . . (369) 
<223> n = A,T,C or G 



<400> 294 

ggcacgaggc 

tgtgtcccct 

gccctgcggc 

nnnnnnnnnn 

nnnnnnnnnn 

nnnnnnnnnn 

attaaattt 



cagctgctgg 
gtttactttt 
ccacctgatg 
nnnnnnnnnn 
ntnaatatat 
nnnnnnnnnn 



tggagcggca 
agctgagctg 
tttacgtgtg 
nnnnnnnnnn 
ttttttgttt 
nnnnnnnnnn 



ctggggactg 
gggttgggtg 
tgtgtgaggg 
nnnnnnnnnn 
aatgggtnnn 
nnnnnnnnnn 



gaggctggaa 
tacgggttct 
ggggcggngn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gcgggtggtg 
gttcctctga 
nnnnnnnnnn 
nnnnnnnnna 
nnnnnnnnnn 
nnnataaatt 



60 
120 
180 
240 
300 
360 
369 



<210> 295 
<211> 403 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (403) 
<223> n = A, T r C or G 



<400> 295 

ggcacgagtg cttctctagc tctctaggcc tctccagttt gcacctgtcc ccaccctcca 60 

ctcagctgtc ctgcagcaaa cactccaccc tccaccttcc attttccccc actactgcag 120 

cacctccagg cctgttgcta tagagcctac ctgtatgtca ataaacaaca gctgaagcnn 180 

nnnnnnnnnn nnnnnccccg ccccttaaaa acaatggggg gccgtttacc gaaaacccaa 240 
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actggaaaaa acccttggtg gagttggacc accccccacc taaagggcgg ggaaaaaaag 300 
gctttattgg aaaaattggg gaggctttgg tttaattgga acccataaaa gccggcaaaa 360 
aacaggtaac caccaccatt ggctttcttt ttaggttcag ggg 403 

<210> 296 
<211> 384 
<212> DNA 
<213> Homo sapiens 



<400> 296 

ggcacgagga 

agttttcgcc 

agcgcgagaa 

cggaggagga 

gtgggtgaag 

tggctctgtc 

tgagtggcca 



gaacttctgg 
ctgcatgcgc 
gcactccgcc 
gtgctcgcta 
tggcccatcc 
tgccaccagg 
gaagacatca 



atcgggccca 
caggacccgc 
tgctgcgtgc 
acaggaatta 
atcccagcgc 
atcccagggt 
ccaa 



gctcggaggc 
aggtgcacag 
gcaacgacag 
tgccgtcaaa 
cccagagctt 
gtgtgatgag 



cctcatccac 
cttcattcgc 
gtcgggctgc 
ctcctttcca 
gcgggccaca 
ccctcctccg 



ctgggcgcca 
tcggcgcgcg 
gtgcagacct 
cgctggcagt 
agagacagtt 
aagaccctca 



60 
120 
180' 
240 
300 
360 
384 



<210> 297 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 297 

ggcacgagat 

gaagggagat 

aaatacaaga 

taccatgtag 

tatagagaaa 

ctgcagcata 

aaacatggct 



taagtgaatt 
acatgcaggg 
aagccatggt 
acacactcaa 
atgaagaaaa 
agatggaaga 
taagtataat 



gcgttatatt 
gcttaaagaa 
ttccaatgca 
ggatgttatt 
atcaaaggag 
acttaaagaa 
ccccgatggc 



tatgacctta 
ctaaaggaat 
cagttagaca 
gaagagcagg 
ttagaaaggc 
ggcctgcggc 
actcccaatg 



aggaccagat 
ctttgtctga 
atgagaagaa 
aggaacagat 
agaaacatat 
aaagagatga 

g 



acaggaggta 
agtggaagaa 
caatttgatc 
ggcagaattt 
gtgtagtgtg 
gcttattgag 



60 
120 
180 
240 
300 
360 
401 



<210> 298 
<211> 430 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (430) 
<223> n = A / T / C or 



<400> 298 

aaatgggaga 

gagcggctct 

ctgagctggg 

tacgtgtgtg 

aaatcccant 

attatgttcc 

aggagatgct 

cagctgggaa 



actctgaggt 
ggggactgga 
gttgggtgta 
tgtgaggggg 
aataggtatc 
tgtggtctca 
gaagcttttg 



ncccaacgat 
ggctggaagc 
cgggttctgt 
ggcgggnnac 
tccatcaagc 
gccttggatc 
gactttgggg 



tcgcattcgg 
gggtggtgtg 
tcctctgagc 
gntatanacc 
tgcangagga 
aggagattat 
gtctgttcaa 



cacgaggcca 
tgtcccctgt 
cctgcggccc 
catcttatta 
ggagagagaa 
tgaagatgat 
ccttcatggt 



gctgctgggg 
ttacttttag 
acctgatgtt 
tcaaattaca 
atgagagaca 
tcttgcccta 
acttcaactt 



60 
120 
180 
240 
300 
360 
420 
430 



<210> 299 
<211> 387 
<212> DNA 
<213> Homo 



sapiens 



<400> 299 

ggcacgaggt 

tctggaaatg 

atagccaggt 

gtggaacccc 

aacattagaa 



ttcatacctt 
gatttatata 
ttaggtttag 
ctctactcat 
tgccttgttc 



ctcagaattg 
cagtcatcat 
tgtcaagtat 
aaaaacccca 
tctgaaggct 



gtatatcaag 
aattaccccc 
agtgattgct 
atcagacata 
tacaagacta 



acacatttaa 
ttagaaattg 
ggtctatcac 
tagatgaata 
tacgtcagga 



atataagccc 
gtaatatttt 
tactcatgaa 
gaaccttgat 
tatattaagg 



60 
120 
180 
240 
300 
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agaagctgag gaacgaaaga aacttcgaca agagaatgga aatgtacatg ctatagcata 
actgaagaat aaaatacagg tttgagg 

<210> 300 
<211> 373 
<212> DNA 
<213> Homo sapiens 



360 
387 



<400> 300 

ggcacgagac 

tgttactatg 

tttaaggttg 

tctcctttct 

tgggggagag 

gtatgtatca 

ttgataaaag 



tagtccgact 
gtaacacttt 
ccatggtaac 
gtgcattcat 
gggcgagaag 
tcagttggct 
agt 



ttttatgtgc 
gctttctgaa 
cagccacagt 
gcaagagttt 
gggaaaaatg 
acgttttggt 



tatgcaaaat 
ttggaaggga 
acatatgtaa 
cttgtaagcc 
ggaaatagtc 
tctatgctaa 



agacatcttt 
aaaaaatgta 
ttctttccat 
atcagaagtt 
tgattttaat 
actgtgaaaa 



aacatagtcc 
acgacagcat 
caccccaacc 
acttttagga 
gaaatcaaat 
atcagatgaa 



60 
120 
180 
240 
300 
360 
373 



<210> 301 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<400> 301 

ggcacgagac 

tgttactatg 

tttaaggttg 

tctcctttct 

tgggggagag 

gtatgtatca 

ttgataaaa 



tagtccgact 
gtaacacttt 
ccatggtaac 
gtgcattcat 
gggcgagaag 
tcagttggct 



ttttatgtgc 
gctttctgaa 
cagccacagt 
gcaagagttt 
gggaaaaatg 
acgttttggt 



tatgcaaaat 
ttggaaggga 
acatatgtaa 
cttgtaagcc 
ggaaatagtc 
tctatgctaa 



agacatcttt 
aaaaaatgta 
ttctttccat 
atcagaagtt 
tgattttaat 
actgtgaaaa 



aacatagtcc 
gcgacagcat 
caccccaacc 
acttttagga 
gaaatcaaat 
atcagatgaa 



60 
120 
180 
240 
300 
360 
369 



<210> 302 
<211> 399 
<212> DNA 
<213> Homo sapiens 



<400> 302 

ggcacgaggc 

gtggtggagg 

cagaccaagg 

ctgcaggggg 

gagagcatct 

ttcaccctgc 

ttcttgctgc 



agcagacacg 
cccaggggaa 
agcaggagat 
agcacagcca 
ccgacgcaga 
aggagctgcg 
aggaggagct 



gctgatgatg 
agccctgatc 
gggcagcctg 
gaatggggag 
gaaggtggcc 
ggacgtgctg 
ggcttactat 



atcaaccatg 
gaacagaagg 
cgagcagagc 
gaggagcctg 
atggatctca 
cacgagagga 
aagagtgag 



accttcggca 
tggagctgga 
tggggaagtt 
agacggagcc 
aggaccccaa 
acgagctcaa 



ccgggtcacg 
ggcagacctg 
gcgagagagg 
ggtgggagag 
ccgcccccgg 
gtccaaggtg 



60 
120 
180 
240 
300 
360 
399 



<210> 303 
<211> 391 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> (1) . . . (391) 
<223> n = A,T,C or G 



<400> 303 

ggcacgagca cagcccctga ctgccgcagc ccccacagag cccgccgcgc accccacgtc 60 

ccccacgcca gcgcccagcc atggaggcca tcaagnnnnn nnnnnnnnnn nnnnnnnngg 120 

acaaggagaa tgccatcgac cgcgcggagc aggcggaggc ggataagaaa gccgctgagg 180 

acaagtgcaa gcaggtggag gaggagctga cgcacctcca gaagaaacta aaagggacag 240 

aggacgagct ggataaatat tccgaggacc tgaaggacgc gcaggagaag ctggagctca 300 

cggagaagaa ggcctccgac gctgaaggtg atgtggccgc cctcaaccga cgcatccagc 360 

tcgttgagga ggagttggac agggctcang a 391 
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<210> 304 
<211> 418 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (418) 
<223> n = A,T,C or G 



<400> 304 

ggcacgagtg 

cccagccatg 

catcgaccgc 

ggtggaggag 

taaatattcc 

ctccgacgct 

gttggacagg 



ccgcagcccc 
gaggccatca 
gcggagcagg 
gagctgacgc 
gaggacctga 
gaaggtgatg 
gctcangaac 



cacagagccc 
agnnnnnnnn 
cggaggcgga 
acctccagaa 
aggacgcgca 
tggccgccct 
gactggccac 



gccgcgcacc 
nnnnnnnnnn 
taagaaagcc 
gaaactaaaa 
ggagaagctg 
caaccgacgc 
ggccctgcag 



ccacgtcccc 
nnrmnggaca 
gctgaggaca 
gggacagagg 
gagctcacgg 
atccagctcg 
aagctggagg 



cacgccagcg 
aggagaatgc 
agtgcaagca 
acgagctgga 
agaagaaggc 
ttgaggagga 
aggcagaa 



60 
120 
180 
240 
300 
360 
418 



<210> 305 
<211> 420 
<212> DNA 
<213> Homo 



sapiens 



<400> 305 

ggcacgagga 

cactgatctg 

cggttttagt 

taagctgggc 

ttataagcag 

tctgcaggcc 

tgctaatgaa 



tttcggcaac 
atgaatattc 
cttgacacct 
tttgaagaat 
tatgacaggg 
gcaggcttcc 
gatggagata 



aatttacaca 
tcaacaaagt 
gccggagcat 
ttaagtatct 
accattctgg 
agctaaatga 
tggattttaa 



gctggctgga 
cctttctaag 
tgtgtctgtc 
gtggaacaac 
gtctctggga 
acaactttac 
caatttcatc 



ccagacatgg 
cacaaagatc 
atggacagtg 
atcaagaaat 
agttctcagc 
caaatgattg 
agctgcttgg 



aggtgggtgc 
ttaagactga 
acacgactgg 
ggcagtgtgt 
tgcggggagc 
tccgccggta 
tccgcctgga 



60 
120 
180 
240 
300 
360 
420 



<210> 306 
<211> 399 
<212> DMA 
<213> Homo 



sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (399) 
<223> n = A,T,C or G 



<400> 306 

ggcacgagcc 

nnnnnnnnnn 

agaaagccgc 

aactaaaagg 

agaagctgga 

accgacgcat 

ccctgcagaa 



acgtccccca 
nnnggacaag 
tgaggacaag 
gacagaggac 
gctcacggag 
ccagctcgtt 
gctggaggag 



cgccagcgcc 
gagaatgcca 
tgcaagcagg 
gagctggata 
aagaaggcct 
gaggaggagt 
gcagaanaag 



cagccatgga 
tcgaccgcgc 
tggaggagga 
aatattccga 
ccgacgctga 
tggacagggc 
ctgcagatg 



ggccatcaag 
ggagcaggcg 
gctgacgcac 
ggacctgaag 
aggtgatgtg 
tcaggaacga 



nnnnnnnnnn 
gaggcggata 
ctccagaaga 
gacgcgcagg 
gccgccctca 
ctggccacgg 



60 
120 
180 
240 
300 
360 
399 



<210> 307 
<211> 438 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (438) 
<223> n = A,T r C or G 



<400> 307 
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atcccatcga 
ccacgccagc 
aaggagaatg 
aagtgcaagc 
gacgagctgg 
gagaagaagg 
gttgaggagg 
gagggcagaa 



ttcgaattcg 
gcccagccat 
ccatcgaccg 
aggtggagga 
ataaatattc 
cctccgacgc 
agttggacag 
aaagctgc 



<210> 308 
<211> 419 
<212> DNA 
<213> Homo sapiens 



<400> 308 
ggcacgagct 

cggggcccgc 

ggtgggcacc 
gctggtagcc 
gggtggcagt 
ggccagctgg 
tgtgctggcc 



ttggcctgcc 
gtcaatgcct 
caacagctgg 
agctactgcc 
gtacccgtca 
ggtgcagaca 
gtgctgctcc 



<210> 309 
<211> 415 
<212> DNA 
<213> Homo sapiens 



<400> 309 

ggcacgaggc 

ctcccgatgc 

cttcttggcc 

ggtcaagctg 

aagggcatcg 

aagggcaacc 

aaggataatg 



<210> 310 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 310 

ggcacgagcg 

atggacggct 

gagtcctgca 

tgggtgctgg 

tgcagcgctg 

tccagctgtg 

tacgcgaggg 

<210> 311 
<211> 394 
<212> DNA 
<213> Homo 

<400> 311 

ggcacgaggc 

catgcacgtg 

tgccccgtcc 

gctgagcctc 

ctacaaggat 

tgtggactgt 



gcacgagccc 
ggaggccatc 
cgcggagcag 
ggagctgacg 
cgaggacctg 
tgaaggtgat 
ggctcatgaa 



ccacagagcc 
aagnnnnnnn 
gcggaggcgg 
cacctccaga 
aaggacgcgc 
gtggccgccc 
cgactgggca 



cgccgtgcac 


cccacgtccc 


60 


nnnnnnnnnn 


nnnnnnggac 


120 


ataagaaagc 


cgctgaggac 


180 


agaaactaaa 


agggacagag 


240 


aggagaagct 


ggagctcacg 


300 


tcaaccgacg 


catccagctc 


360 


cggacctgca 


gaagctggag 


420 






438 



cgctcctctc 
gggcctactg 
gggagttcca 
cacaggtggt 
ttatcagcac 
ggtcctactg 
cagttttatt 



ctttctggcg 
ccacgtgcta 
gtgctggtca 
ggaggacggc 
atcgcgtgtg 
gaaggagttc 
cttgctctac 



acccgactct 
cccactgggg 
ctagaggagg 
gtggcagacc 
agtgcaccac 
ctggtgatgt 
cggcaccgga 



ggctacgcaa 
acctgctgct 
gcttccagca 
aaacagatga 
ctggtggcaa 
gcacgctctt 
acagcatgg 



tgagccagag 
gctgcccgcc 
ggaggcatcg 
ctgctgccgt 
tggactgcat 
ttgccaacgt 
acatgcagat 



acgccctcca 
ctgccaccat 
tcgccgtcat 
ccagcacgcc 
tgtccgcatc 
caatcgctac 
cttactgggg 



ttctctcttc 
gacggaacag 
cttcaagacg 
agcaagcaga 
cccaaagagc 
ttccccactc 
ggcgtggaca 



gcgcccgctc 
gccatctcct 
gacgtggctc 
tcgccgccg : a 
atggagtgct 
aagccctcaa 
aacacacgca 



tccggctggc 
tcgacaaaga 
ctatcgagcg 
ctagcagtac 
gtccttctgg 
cttcgtcttc 
ggtct 



ggtcctgccg 
acttcagctc 
agacgtgctc 
acgagggcgc 
cgcagttctg 
tgggctgcac 
agcacggaca 



sapiens 



ctctgggccc 
gcctaccgcg 
ctgtacgaca 
atcttcaagg 
tatgacatgc 
tacatctccc 



gtgccacatg 
gctccggaac 
gggcctgacc 
ctgtgtggat 
taagaacgcc 
aggggaaggc 
gtgtgcagat 



gggtaccagg 
gagacccaca 
aacagagact 
gtggacgagt 
aacggctcct 
ccaggaaact 
gtggac 



gcccgctgtg 
gcatctgcac 
gcggcgagtg 
gtgcggccga 
acacgtgcga 
gtaaagagtg 



cactgactgc 
agcctgtgac 
tgaagtgggc 
gccgcctccc 
agagtgtgac 
tatctctggc 



tacagctcat 
aggaacgcga 
acctgagcaa 
ccgccgtgga 
cccgcgtggt 
ggcccacgga 



cctggtcacg 
gcgcaagcac 
gaagcggggc 
tgctggcttc 
ggcctgctcc 
gaagaaggtc 



tgcccctcac 
cacctgaaac 
ggactgtggt 
ctctatatct 
gtggagcctt 
ttcacctact 



tgctcgtggt 
acgggcccaa 
ggacgtactt 
tccaccgcct 
gcccccacac 
tcatggtgac 



60 
120 
180 
240 
300 
360 
419 



60 
120 
180 
240 
300 
360 
415 



60 
120 
180 
240 
300 
360 
396 



60 
120 
180 
240 
300 
360 
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cacagctgca tggagatctt cggccccagg cacc 394 

<210> 312 
<211> 384 
<212> DNA 
<213> Homo sapiens 



<400> 312 

ggcacgaggc gaggaacgcg agcgcaagca ccacctgaaa cacgggccca atgccccgtc 60 

cctgtacgac aacctgagca agaagcgggg cggactgtgg tggacgtact tgctgagcct 120 

catcttcaag gccgccgtgg atgctggctt cctctatatc ttccaccgcc tctacaagga 180 

ttatgacatg ccccgcgtgg tggcctgctc cgtggagcct tgcccccaca ctgtggactg 240 

ttacatctcc cggcccacgg agaagaaggt cttcacctac ttcatggtga ccacagctgc 300 

catctgcatc ctgctcaacc tcagtgaagt cttctacctg gtgggcaaga ggtgcatgga 360 

gatcttcggc cccaggcacc ggcg 384 



<210> 313 
<211> 430 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (1) . . . (430) 
<223> n = A r T,C or G 



<400> 313 

ggcacgagcc 

cgtgccagca 

gcgggcttgg 

ttgcagtcag 

ggacaggacg 

gagctggccg 

gccaagcaca 

cgggaggaan 



ggctcgtaag 
gcaacctgca 
tgcacggcta 
tgtgcgtgtt 
ccgcggcccg 
cccttgcaga 
ttttgcactg 



caacctcttc 
ccgcggtccg 
ccgggtagca 
ccgcgagcgg 
gctgctgcgc 
cccccacgct 
taaggccaag 



agtctgcagt 
ggcggggcgc 
ggcatgtggg 
accgcgcggc 
ctctaccggc 
cagctgctac 
tacggccgcc 



gggacccgcg 
tggtctttct 
acaagtataa 
gcgtcctgga 
gccacgagcc 
agcgccgcct 
ggtctgggac 



cgtcatgcag 
ggacaatgag 
cgagccgctg 
gctgcaccgc 
tcgcttcccc 
cgacttcctc 
ttagtgtcac 



60 
120 
180 
240 
300 
360 
420 
430 



<210> 314 
<211> 408 
<212> DNA 
<213> Homo 



sapxens 



<400> 314 

ggcacgagag 

ctatgacgag 

cacgtgcccc 

gcaccacctg 

gggcggactg 

cttcctctat 

ctccgtggag 



cagaaggact 
ttcttccccg 
tcactgctcg 
aaacacgggc 
tggtggacgt 
atcttccacc 
ccttgccccc 



ttgtctgcaa 
tgtcccacgt 
tggtcatgca 
ccaatgcccc 
acttgctgag 
gcctctacaa 
acactgtgga 



caccaagcag 
gcgcctctgg 
cgtggcctac 
gtccctgtac 
cctcatcttc 
ggattatgac 
ctgttacatc 



cccggctgcc 
gccctacagc 
cgcgaggaac 
gacaacctga 
aaggccgccg 
atgccccgcg 
tcccggcc 



ccaacgtctg 
tcatcctggt 
gcgagcgcaa 
gcaagaagcg 
tggatgctgg 
tggtggcctg 



60 
120 
180 
240 
300 
360 
408 



<210> 315 
<211> 412 
<212> DNA 
<213> homo sapiens 



<400> 315 

tcggagccca tgcgcagcgg ggcgcgttag ctcgcgctct tcctgacccc cgatcctggg 60 

gccgaggtac ctttgacagg agcgtgaccc tgctggaggt gtgcgggagc tggcctgagg 120 

gcttcgggct gcggcacatg tcctccatgg agcacacgga ggagggcctc cgggagcgac 180 

ttgccgacgc catggccgag tcacctagcc gggacgtcgt gggatccgga acagaacttc 240 

agcgagaggg aagcatcgag actctgagta acagctcagg ctccaccagc ggcagcatac 300 

caagaaactt tgatggctac cgatctccgc tgcccaccaa tgagagccag cccctcagcc 360 

tcttcccgac tggcttcccg taggtaccag caacctgctt ctgactggcc ag 412 



WO 01/72781 



PCT/US01/09952 



<210> 316 
<211> 300 
<212> DNA 
<213> homo sapiens 



<400> 316 

gccagcccct 

tggccagccc 

tctctgctct 

acctggatgg 

tgtcggtgga 

<210> 317 
<211> 2064 
<212> DNA 
<213> homo 

<400> 317 

acctcagcca 

ctcgtggtgc 

cgattttcca 

gccgctggct 

cctgtggaga 

aaagtggcct 

gtgtgacaga 

gcatcatcac 

attctgacag 

tggcttttaa 

atgtcttcca 

atactcttca 

gtcgctgggt 

atggtggcca 

gtttccagaa 

gtcgctgtag 

aactgaacag 

tctctcctct 

cattggggac 

gcaaatcgat 

ttgcaaataa 

ccctgtacat 

tcagcactgc 

ccagctggac 

accaccctct 

tggttccccc 

ctgaccatag 

ctggacccca 

caggccaaac 

acacgccaca 

tccagccagt 

gaaggggagt 

tgctggaggt 

agcacacgga 

gggacgtcgt 

<210> 318 
<211> 1365 
<212> DNA 
<213> homo 

<400> 318 
cgagaactct 
gctaccgatc 
tcccgtaggt 



cagcctcttc ccgactggct tcccgtaggt accagcaacc tgcttctgac 60 

cctcccctgc tggaggaggg gagaagcccc gctctggtcc tacccttcag 120 

tccttcatca accaccttcc ccaagcttag tgacagcagc cgcccatcct 180 

agaagagacc cttctccaag cacctcagcg cacttgccct ctgccacacc 240 

ggctgtggcc aggagagact gtagaagctc ggtccctgtg tatgtttgca 300 



sapiens 



gattcggcac gaggggcgta ggaccctccg agccaggtgt gggatatagt 60 

gccgtttttt aagccggtct gaaaagcgca atattcgggt gggagtgacc 120 

ggctgctatc catgtccagg gccaaacatg aatcctattg ctcttgggga 180 

tgcttatgca gaaaacaagt tgattcgatg tcatcagtcc cgtggtggag 240 

caacattcag tcttatactg ccacagtcat tagtgctgct aaaacattga 300 

gacaatggta gggaaagtgg tgactcagct gacaggcaca ctgccttcag 360 

agatgatgtt gccatccaca gtaattcacg gcggagtcct ttggtcccag 420 

agttattgac accgaaaccg ttggagaggg ccaggtgctt gtgagtgagg 480 

tgatggcatt gtggcccact tccctgccca tgagaagcca gtgtgctgca 540 

tacaagtgga atgcttctag tcacaacaga cacccttggc catgactttc 600 

aattctgact catccttggt cctcatcaca atgtgctgtc caccatctgt 660 

caggggagaa actgaagcca aagtacagga catctgcttc agccatgact 720 

tgtggtcagt actctccggg gtacttccca cgttttcccc atcaaccctt 780 

gccttgtgtt cgtacacata tgtcaccacg agtagtgaat cgcatgagcc 84 0 

aagtgctgga ctggaagaga ttgaacaaga actgacgtct aagcaaggag 900 

ccctgttcca ggtctatcaa gcagcccttc tgggtcaccc ttgcatggga 960 

ccaagactcc tataacaatt ttaccaacaa caaccctggc aaccctcggc 1020 

tcccagcttg atggtagtga tgcctcttgc acaaatcaag cagccaatga 1080 

catcaccaaa cgaaccggca aagttaaacc tcctccacaa atttcaccca 1140 

gggcggagaa ttttgtgtgg ctgctatctt cggaacatcc aggtcatggt 1200 

tgcaggtctg aaaagagaaa aagatcagtc caaacaagtt gtagttgagt 12 60 

tatcagttgc tatggcacct tagtggaaca catgatggag ccgcgacccc 1320 

acccaagatt agtgacgaca caccactgga aatgatgaca tcgcctcgag 1380 

tctggttaga acccctcaat ggaatgaatt gcagccaccg tttaatgcaa 14 4 0 

gctcctcgct gcagatgcag tacagtatta tcagttcctg cttgctggcc 1500 

tggaagtcct gggcccatta ctcgacatgg gtcttacgac agtttagctt 1560 

tggacaggaa gatgaagaat ggctttccca ggttgaaatt gtaacacaca 1620 

tagacgtctg tggatgggtc cacagttcca gttcaaaacc atccatccct 1680 

cacagttatc tcatccagtt catctgtgtt gcagtctcat ggtccgagtg 1740 

gcctcttttg gattttgata cagatgatct tgatctcaac agtctcagga 1800 

ccgctctgac cccgtcagca tgccagggtc atcccgtcca gtctctgatc 18 60 

ttccacagtg attgatgctg cctcaggtac ctttgacagg agcgtgaccc 1920 

gtgcgggagc tggcctgagg gcttcgggct gcggcacatg tcctccatgg 1980 

ggagggcctc cgggagcgac ttgccgacgc catggccgag tcacctagcc 2 04 0 

gggatccgga acag 2064 



sapiens 



gagtaacagc tcaggctcca ccagcggcag cataccaaga aactttgatg 60 
tccgctgccc accaatgaga gccagcccct cagcctcttc ccgactggct 12 0 

accagcaacc tgcttctgac tggccagccc cctcccctgc tggaggaggg 180 
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gagaagcccc 
ccaagcttag 
cacctcagcg 
gtagaagctc 
attttttaac 
gggggcctga 
gtcacagcct 
ccccactcca 
atgcctcttc 
ctttaagccc 
ccctgttgaa 
aaaaaaaaag 
ttagcttggc 
acttaatcgc 
caccgatcgc 
cggcgcatta 
gcgccctagc 
tttcccccgt 
ccggcacctc 
ctgataagac 



gctctggtcc 
tgacagcagc 
cacttgccct 
ggtccctgtg 
catacccacg 
gtggggacag 
cagctcatgt 
gcacggggtg 
ttcctccttt 
ctttaccctg 
aagtgcacgt 
gatctttaat 
actggccgtc 
cttgcagcac 
ccttcccaac 
agcgcggcgg 
gcccgctcct 
caagctctaa 
gaccccaaaa 
ggtttttcgc 



tacccttcag tctctgctct tccttcatca accaccttcc 
cgcccatcct acctggatgg agaagagacc cttctccaag 
ctgccacacc tgtcggtgga ggctgtggcc aggagagact 
tatgtttgca tatgacatcc tgcattggat ccgcttttgt 
gtggggcggg tggggggagc ctggaacagt gaccagatct 
agttgatcgt ccacctggcc attttgaccc tgagtggaca 
ctggctgtga cacacactgc ccccagcttc ccttggtcag 
aacggaggcc cagagtacta gggaaggagg aagggaggac 
ctttccccat ctgttcctgg gaagagtttg tctttcttat 
gtcctgtact gatcagtgaa ggaaaccgtg gttactgagg 
cttgtccaat aaatcacgct gcagttggaa aaaaaaaaaa 
taagcggccg caagcttatt ccctttagtg agggttaatt 
gttttacaac gtcgtgactg gtaaaccctg gcgttaccca 
atcccccttt cgccagctgg cgtaatagcg aagaggcccg 
agttgcgcag cctgaatggc gaatgggacg cgccctgtag 
gtgtggtggt taccgcgcag cgtgaccgct acacttgcca 
ttcgctttct tccccttcct ttttcgccac gttcgccggc 
atcgggggct cccctttagg gttcccgatt tagtgcttta 
aacttgatta gggtgatggt tcacgtagtg ggccatcgcc 
cctttgacgt tggagtccac gttct 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1365 



<210> 319 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 
<400> 319 

tgggatatag tctcgtggtg eg 22 

<210> 320 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 



<210> 321 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 
<400> 321 

tgtgtcacag ccagacatga gc 22 

<210> 322 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 



<400> 320 

tgattcgatg tcatcagtcc eg 



22 



<400> 322 
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tgcaaacata cacagggacc g 

<210> 323 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 
<400> 323 

tttagcagca ctaatgactg tggc 

<210> 324 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthesized primer 
<400> 324 

cgccgtgaat tactgtggat gg 
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21 



24 
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